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s-r 1. General principles 
< 

Q\ . Classical Mechanics is a theory of point particles motions. If X = (xi, . . ., x n ) are the 

particles positions in a Cartesian inertial system of coordinates, the equations of motion 
are determined by their masses (mi, . . . , m n ), rnj > 0, and by the potential energy of 
>• ■ interaction V(xi, . . . , x n ) as 

in ■ 
oo 

^j- ■ TOjXi = -d Xi V(x±, . . . ,x n ), i = l,...,n (1.1) 

O 

here Xj = (xn, . . . , Xid) are coordinates of the z-th particle and d Xi is the gradient (d Xil , . . . , 
d Xid ); d is the space dimension (i.e. d = 3, usually). The potential energy function will be 
Q_c supposed "smooth", i.e. analytic except, possibly, when two positions coincide. The latter 
exception is necessary to include the important cases of gravitational attraction or, when 
dealing with electrically charged particles, of Coulomb interaction. A basic result is that if 
V is bounded below the equation (1.1) admits, given initial data Xo = X(0),Xo = X(0), 
a unique global solution t — > X(£), t G (—oo, oo); otherwise a solution can fail to be global 
if and only if, in a finite time, it reaches infinity or a singularity point (i.e. a configuration 
in which two or more particles occupy the same point: an event called a collision). 

In Eq. (1.1) —d Xi V(x.i, . . . ,x n ) is the force acting on the points. More general forces 
are often admitted. For instance velocity dependent friction forces: they are not consid- 
ered here because of their phenomenological nature as models for microscopic phenomena 
which should also, in principle, be explained in terms of conservative forces (furthermore, 
even from a macroscopic viewpoint, they are rather incomplete models as they should 
be considered together with the important heat generation phenomena that accompany 
them). Another interesting example of forces not corresponding to a potential are certain 
velocity dependent forces like the Coriolis force (which however appears only in non iner- 
tial frames of reference) and the closely related Lor entz force (in electromagnetism): they 
could be easily accomodated in the upcoming Hamiltonian formulation of mechanics, see 
Appendix A2. 

The action principle states that an equivalent formulation of the equations (1.1) is 
that a motion t — > Xo(t) satisfying (1.1) during a time interval [£1,^2] an d leading from 
X 1 = Xo(ti) to X 2 = Xo(i2)) renders stationary the action 
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-4({X}) = ( E \™i Mt) 2 ~ V(X.{t)j) dt (1.2) 

i=l 

within the class Mt 1 ,t 2 (X 1 , X 2 ) of smooth (i.e. analytic) "motions" t — > X(t) defined for 
t G [^1,^2] and leading from X 1 to X 2 . 

The function £(Y, X) = \ £™ = i m t y 2 - V(X) = f K(Y) - V(X), Y = (y 1} . . . , y n ), is 

called the Lagrangian function and the action can be written as C(X(t), X(t)) dt. The 

quantity K(X.(t)) is called kinetic energy and motions satisfying (1.1) conserve energy as 
time t varies, i.e. 

K(X(t)) +V(X(t)) = E = const (1.3) 

Hence the action principle can be intuitively thought of as saying that motions proceed 

by keeping constant the energy, sum of the kinetic and potential energies, while trying to 

share as evenly as possible their (average over time) contribution to the energy. 

In the special case in which V is translation invariant motions conserve linear mo- 
de f 

mentum Q = £V rriiXi; if V is rotation invariant around the origin O motions conserve 

angular momentum M d = £V vtii Xj A Xj, where A denotes the vector product in IR d , i. e. it 
is the tensor (aAb)jj = aibj — biaj, i,j = 1, . . . , d: if the dimension d = 3 the aAb will be 
naturally regarded as a vector. More generally to any continuous symmetry group of the 
Lagrangian correspond conserved quantities: this is formalized in the Noether theorem. 

It is convenient to think that the scalar product in R an is defined in terms of the 
ordinary scalar product in ]R d , a • b = Yl,j=\ a 3^ji by (v, w) = Y17=i m i v i ' w * : so that 
kinetic energy and line element ds can be written as K(X) = |(X, X) and, respectively, 
ds 2 = Xir=i TO ^ x i- Therefore the metric generated by the latter scalar product can be 
called kinetic energy metric. 

The interest of the kinetic metric appears from the Maupertuis ' principle ( equivalent 
to (1.1)): the principle allows us to identify the trajectory traced in ]R d by a motion that 
leads from X 1 to X 2 moving with energy E. Parameterizing such trajectories as r — > X(r) 
by a parameter r varying in [0, 1] so that the line element is ds 2 = (<9 T X, d T ~K.)dr 2 , the 
principle states that the trajectory of a motion with energy E which leads from X 1 to X 2 
makes stationary, among the analytic curves £ G 7Wo,i(X 1 , X 2 ), the function 

L(£) = J y/E-V(£( 8 ))d8 (1.4) 

so that the possible trajectories traced by the solutions of (1.1) in M nd and with energy E 
can be identified with the geodesies of the metric dm 2 d = (E — V(X)) ■ ds 2 . 

References: [LL68], [Ga83] 
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2. Constraints 



Often particles are subject to constraints which force the motion to take place on a 
surface M C M nd : i.e. X(£) is forced to be a point on the manifold M. A typical example 
is provided by rigid systems in which motions are subject to forces which keep the mutual 
distances of the particles constant: |xj — Xj | = pij with pij time independent positive quan- 
tities. In essentially all cases the forces that imply constraints, called constraints reactions, 
are velocity dependent and, therefore, are not in the class of conservative forces considered 
here, cf. (1.1). Hence, from a fundamental viewpoint admitting only conservative forces, 
constrained systems should be regarded as idealizations of systems subject to conservative 
forces which approximately imply the constraints. 

In general the ^-dimensional manifold M will not admit a global system of coordinates: 
however it will be possible to describe points in the vicinity of any X° e M by using 
N = nd coordinates q = (q±, . . . , qt, qi+i, . . . , gjv) varying in an open ball i?x o: X = 
X(q 1 ,...,qt,q e +i, ...,gjv). 

The (/-coordinates can be chosen well adapted to the surface M and to the kinetic met- 
ric, i.e. so that the points of M are identified by q£ + i = . . . = q^ = (which is the meaning 
of "adapted") and furthermore infinitesimal displacements (0, . . . , 0, dee+i, . . . , de^) out of 
a point X° e M are orthogonal to M (in the kinetic metric) and have a length independent 
of the position of X° on M (which is the meaning of 'well adapted" to the kinetic metric). 

Motions constrained on M arise when the potential V has the form 

V(X) = V a (K) + XW(X) (2.1) 

where W is a smooth function which reaches its minimum value, say equal to 0, precisely 
on the manifold M while V a is another smooth potential. The factor A > is a parameter 
called the rigidity of the constraint. 

A particularly interesting case arises when, furthermore, the level surfaces of W have 
the geometric property of being "parallel" to the surface M: in the precise sense that 
the matrix dq. q W(X.), i,j>£ is positive definite and X-independent for all X e M in a 
system of coordinates well adapted to the kinetic metric. 

A potential W with the latter properties can be called an approximately ideal con- 
straint reaction. In fact it can be proved that, given an initial datum X° e M with velocity 
X° tangent to M, i.e. given an initial datum whose coordinates in a local system of coor- 
dinates are (q , 0) and (q , 0) with q = («? i, • • • , Qoi) and q = (<?oi, • • • , qot), the motion 
generated by (1.1) with V given by (2.1) is a motion t — > X A (t) which 

(1) as A — > oo tends to a motion t — > X^i), 

(2) as long as Xoo(t) stays in the vicinity of the initial data, say for < t < t±, so that 
it can be described in the above local adapted coordinates, its coordinates have the form 
t — > (q(i), 0) = (qi(t), . . . , qe(t), 0, . . . , 0): i.e. it is a motion developing on the constraint 
surface M, 

(3) the curve t — > Xoo(t), t G [0, ti], as an element of the space Aio,^ (X°, Xqo^i)) of 
analytic curves on M connecting X° to Xoo(ti), renders the action 
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stationary. 



A(X)= f 1 (K(X(t))-V a (X(t)))dt 
Jo 



(2.2) 



The latter property can be formulated "intrinsically", i.e. referring only to M as a 
surface, via the restriction of the metric ds 2 to line elements ds = (dqi, . . . , dqi, 0, . . . , 0) 
tangent to M at the point X = (q , 0, . . . , 0) G M; we write ds 2 = Yli'j diji^dqi dqj. The 
i x I symmetric positive definite matrix g is called the metric on M induced by the kinetic 
metric. Then the action in (2.2) can be written as 

rti n M 

A ^ = J (zYs^Mmmw-Vaiciittydt (2.3) 

where V^ a (q) = V a (X(g 1 , . . . , q£, 0, . . . , 0)): the function 

defl ^ - 1 - 

£(»7,q) = 2 ^29iM)viVj - Va(q) = ^(q^^- v a (q) (2.4) 

is called the constrained Lagrangian of the system. 

An important property is that the constrained motions conserve the energy defined 
as E = §(#(q)q,q) + V (q), see Sec. 3. 

The constrained motion Xoo (t) of energy i? satisfies the Maupertuis' principle in the 
sense that the curve on M on which the motion develops renders 

L($ = J y/E-V a ($(s))ds (2.5) 

stationary among the (smooth) curves that develop on M connecting two fixed values 
Xi and X2. In the particular case in which £ = n this is again Maupertuis' principle 
for unconstrained motions under the potential V(X). In general I is called the number of 
degrees of freedom because a complete description of the initial data requires 21 coordinates 
q(0),q(0). 

If W is minimal on M but the condition on W of having level surfaces parallel to M 
is not satisfied, i.e. if W is not an approximate ideal constraint reaction, it remains still 
true that the limit motion 'X. 00 (t) takes place on M. However, in general, it will not satisfy 
the above variational principles. For this reason motions arising as limits as A — > 00 of 
motions developing under the potential (2.1) with W having minimum on M and level 
curves parallel (in the above sense) to M are called ideally constrained motions or motions 
subject by ideal constraints to the surface M. 

As an example suppose that W has the form W(X) = - £V Wij (|xj — Xj|) with 
w ij(\£,\) > an analytic function vanishing only when |£| = Pij for i,j in some set of pairs 
V and for some given distances (for instance = (£ 2 — p 2 j) 2 7, 7 > 0). Therefore 

the so constrained motions X^ (t) of the body satisfy the variational principles mentioned 
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in connection with (2.3) and (2.5): in other words the above natural way of realizing a 
rather general rigidity constraint is ideal. 

This is the modern viewpoint on the physical meaning of the constraints reactions: 
looking at motions in an inertial cartesian system it will appear that the system is subject 
to the applied forces with potential V a (X) and to constraint forces which are defined as 
the differences Ri = + 9 Xi V r a (X). The latter reflect the action of the forces with 

potential AVF(X) in the limit of infinite rigidity (A — > oo). 

In applications sometimes the action of a constraint can be regarded as ideal: so that 
the motion will verify the mentioned variational principles and the R can be computed as 
the differences between the mjXj and the active forces — d Xi V a (X.). In dynamics problems 
it is, however, a very difficult and important matter, particularly in engineering, to judge 
whether a system of particles can be considered as subject to ideal constraints: this leads 
to important decisions in the construction of machines. It simplifies the calculations of the 
efforts of the materials but a misjudgement can have serious consequences about stability 
and safety. For statics problems the difficulty is of lower order: usually assuming that the 
constraints reaction is ideal leads to an overestimate of the requirements for stability of 
equilibria. Hence employing the action principle to statics problems, where it constitutes 
the virtual works principle, generally leads to economic problems rather than to safety 
issues. Its discovery even predates Newtonian mechanics. 

References: [Ar68], [Ga83]. 

3. Lagrange and Hamilton form of the equations of motion 

The stationarity condition for the action A(q), cf. (2. 3), (2. 4), is formulated in terms 
of the Lagrangian £(77, £), see (2.4), by 

-^£(q(t), q(f)) = d qi C(q(t), q(f)), i = 1, . . . , I (3.1) 

which is a second order differential equation called Lagrangian equation of motion. It can 
be cast in "normal form": for this purpose, adopting the convention of "summation over 
repeated indices" , introduce the "generalized momenta" 

Pi d - ' gfakjQj, i = l,...,£ (3.2) 

Since g(q) > the motions t — > q(i) and the corresponding velocities t — > q(£) can be 
described equivalently by t — > (q(t),p(t)): and the equations of motion (3.1) become the 
first order equations 

q t = d Pi H(p, q), pi = -d Qi H(p, q) (3.3) 
where the function Tl, called Hamiltonian of the system, is defined by 
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H(p,q) = / -(^(q)- 1 p,p) + ^(q) 



(3.4) 



Eq. (3.3), regarded as equations of motion for phase space points (p, q), are called Hamilton 
equations. In general q are local coordinates on M and motions are specified by giving 
q, q or p, q. 

Looking for a coordinate free representation of motions consider the pairs X, Y with 
X e M and Y a vector Y e Tx tangent to M at the point X. The collection of pairs 
(Y,X) is denoted T(M) = U X6 m(7x x {X}) and a motion t -> (X(t),X(t)) e T(M) in 
local coordinates is represented by (q(t), q(£))- The space T(M) can be called the space of 
initial data for Lagrange's equations of motion: it has 2£ dimensions (more fancily known 
as the tangent bundle of M). 

Likewise the space of initial data for the Hamilton equations will be denoted T* (M) 
and it consists in pairs X, P with X € M and P = g(X) Y with Y a vector tangent to M 
at X. The space T*(M) is called the phase space of the system: it has 2£ dimensions (and 
it is occasionally called the cotangent bundle of M). 

Immediate consequence of (3.3) is 4rH(p(t), q(t)) = and it means that H(p(t), q(t)) 
is constant along the solutions of the (3.3). Remarking that H(p, q) = §(<7(q)q, q) + ^ a (q) 
is the sum of the kinetic and potential energies it follows that the conservation of H along 
solutions has the meaning of energy conservation in presence of ideal constraints. 

Let St be the flow generated on the phase space variables (p, q) by the solutions of 
the equations of motion (3.3), i.e. let t — > St(p, q) = (p(t), q(£)) denote a solution of (3.3) 
with initial data (p, q). Then a (measurable) set A in phase space evolves in time t into a 
new set St A with the same volume: this is obvious because the Hamilton equations (3.3) 
have manifestly divergence ("Liouville's theorem"). 

The Hamilton equations also satisfy a variational principle, called Hamilton action 
principle: i.e. if -Mt l5 t 2 ((pi, qi), (p2,q2);-&0 denotes the space of the analytic functions 
if : t — > (iz(t), n(t)) which in the time interval [£1,^2] lead from (pi,qi) to (p2,q2), then 
the condition that (po(t) = (p(£),q(t)) satisfies (3.3) can be equivalently formulated by 
requiring that the function 



be stationary for ip = <p : in fact the (3.3) are the stationarity conditions for the Hamilton 
action (3.5) on M.t M ((Pi> qi)) (P25 q2)j M). And since the derivatives of n(t) do not 
appear in (3.5) stationarity is even achieved in the larger space -Mti,t 2 ^2; M) of the 
motions ip : t — > (7r(t), #«(£)) leading from qi to q2 without any restriction on the initial 
and final momenta pi,P2 (which therefore cannot be prescribed a priori independently 
of qi,q2)- If the prescribed data Pi,qi,P2,q2 are not compatible with the equations of 
motion, e.g. i?(pi, (I2) 7^ H(p 2 , q2), then the action functional has no stationary trajectory 
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(3.5) 



in M tl ,t 2 ({pi, qi), (p 2 q2); M). 



References: [LL68], [Ar68], [Ga83] 

4. Canonical transformations of phase space coordinates 

The Hamiltonian form, (3.4), of the equations of motion turns out to be quite useful 
in several problems. It is therefore important to remark that it is invariant under a special 
class of transformations of coordinates, called canonical transformations. 

Consider a local change of coordinates on phase space, i.e. a smooth smoothly in- 
vertible map C(7r, k) = (n', k') between an open set U in the phase space of a £ degrees 
of freedom Hamiltonian system, into an open set U' in a 2£ dimensional space. The 
change of coordinates is said canonical if for any solution t — > (ir(t), of equations 
like (3.3), for any Hamiltonian H(7T,k) defined on U, the C-image t — > (7r'(t), «'(£)) = 
C(7r(£), n(t)) are solutions of the (3.3) with the "same" Hamiltonian, i.e. with Hamiltonian 

W(it',k') ^nip-^.K')). 

The condition that a transformation of coordinates is canonical is obtained by using 
the arbitrariness of the function Ti and is simply expressed as a necessary and sufficient 
property of the Jacobian L 



(4.1) 



where i,j = !,...,£. Let E = ( ) denote the 2£ x 2£ matrix formed by four 



1 

£ x £ blocks, equal to the matrix or, as indicated, to the ± identity matrix; then, if a 
superscript T denotes matrix transposition, the condition that the map be canonical is 
that 

L~ 1 =EL T E T or L _1 = ^ —(j T ~A?) (42) 

which immediately implies that det L = ±1. In fact it is possible to show that (4.2) implies 
det L = 1. The (4.2) is equivalent to the four relations AD T — BC T = 1, -AB T +BA T = 0, 
CD T - DC T = 0, -CB T + DA T = 1. More explicitly, since the first and the fourth 
coincide, the relations are 

{n'iiKj} = 6ij, {7r-,7Tj} = 0, = (4.3) 

where, for any two functions F(ir, k), G(tt, k), the Poisson bracket is 



{F, G}(tt, k) = f J2 ( d « k F (n, «) d Kk G(*> K ) " d Kh F(n, k) d Kh G(*> K )) ( 4 - 4 ) 

k=i 
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The latter satisfies Jacobi's identity: + {{G,Q},F} + {{Q,F},G} = 0, 

for any three functions F, G, Q on phase space. It is quite useful to remark that if t — > 
(p(t), q(t)) = St(p, q) is a solution to Hamilton equations with Hamiltonian 7i then, given 

def 

any observable F(p,q), it "evolves" as F(t) = F(p(t), q(t)) satisfying d t F(p(t), q(t)) = 
{7i, F}(p(t), q(t)). Requiring the latter identity to hold for all observables F is equivalent 
to requiring that the t — > (p(t), q(t)) be a solution of Hamilton's equations for 7Y. 

Let C : U < — > U' be a smooth, smoothly invertible, transformation between two 2£ 
dimensional sets: C(7r, re) = (ir', re'). Suppose that there is a function $(7r', re) defined on 
a suitable domain W and such that 

c(-.-)=(-'.-')^{r=tT<i) (45) 

then C is canonical. This is because (4.5) implies that if re, n' are varied and if 7r, re', 7r', re 
are related by C(7r, re) = (n', re'), then 7r • dn + re' ■ dn' = d$>(n', re): which implies 



7T • dK - H(n, k)dt = 7r' • dre' - W(C _1 (7r', n'))dt + d$(7r', re) - d(7r' • re') (4.6) 

and means that the Hamiltonians H(p,q)) and W(p',q')) d =H(C~ 1 (p', q')) have Hamil- 
ton actions and differing by a constant, if evaluated on corresponding motions 
(p(t),q(t)) and (p'(*),q'(*)) = C(p(t), q(t)). 

The constant depends only on the initial and final values (p(ii), q(£i)) and (p(^2), 
q(t 2 )) and, respectively, (p'(ti), q'(ti)) and (p'(t 2 ), q'(^)) so that if (p(t), q(t)) makes 
„4^ extreme also (p'(£), q'(t)) = C(p(t), q(t)) makes A«/ extreme. 

Hence if t — > (p(t),q(t)) solves the Hamilton equations with Hamiltonian 7i(p, q) 
then the motion t — > (p'(t), q'(t)) = C(p(t),q(t)) solves the Hamilton equations with 
Hamiltonian 7i'(p', q') = 7i(C _1 (p', q')) no matter which is the function H: therefore the 
transformation is canonical. The function $ is called its generating function. 

Eq. (4.5) provides a way to construct canonical maps. Suppose that a function 

$(7r', re) is given and defined on some domain W: then setting < 77 . ', s and 

v ; [ re' = dn'^iir, re) 

inverting the first equation in the form iz' = 3(7r, re) and substituting the value for 7r', 

thus obtained, in the second equation, a map C(ir, re) = (77', re') is defined on some domain 

(where the mentioned operations can be performed) and if such domain is open and not 

empty then C is a canonical map. 

For similar reasons if T(re, re') is a function defined on some domain then setting 
7T = <9«T(re, re'), 7r' = — d K 'T(n, re') and solving the first relation to express re' = A(ir, re) 
and substituting in the second relation a map (n', re') = C(7r, re) is defined on some domain 
(where the mentioned operations can be performed) and if such domain is open and not 
empty then C is a canonical map. 

Likewise canonical transformations can be constructed starting from a priori given 
functions F(tt,k') or G(7r,7r'). And the most general canonical map can be generated 
locally (i.e. near a given point in phase space) by a single one of the above four ways, 
possibly composed with a few "trivial" canonical maps in which one pair of coordinates 
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(71-j, Ki) is transformed into (— Kj,7Tj). The necessity of including also the trivial maps can 
be traced to the existence of homogeneous canonical maps, i.e. maps such that 7r • dn = 
7r' • dn' {e.g. the identity map, see below or (9.7) for nontrivial examples) which are action 
preserving hence canonical, but which evidently cannot be generated by a function k') 
although they can be generated by a function depending on n' , n. 

Simple examples of homogeneous canonical maps are maps in which the coordinates 
q are changed into q' = R(q) and, correspondingly, the p's are transformed as p' = 

(<9 q R(q)) 1 T p, linearly: indeed this map is generated by the function F(p' , q) d = p'-R(q). 
For instance consider the map "Cartesian-polar" coordinates (qi,q2) i — > (p, 0) with 

(p, 9) the polar coordinates of q (namely p = y/qf + q%, 9 = atan qijqi) and let n d = q/ |q| 

def def 

= (ni,n2) and t = (-n 2 ,ni). Setting p p = p • n, pg = pp ■ t, the map (pi,P2,qi,q2) i — ► 
(p p , po, p, 9) is homogeneous canonical (because p • <iq = p • n dp + p • t p d9 = p p dp +pg d9) . 

As a further example any area preserving map (p,q)< — >{p',q') defined on an open 
region of the plane K 2 is canonical: because in this case the matrices A, B, C, D are just 
numbers which satisfy AD — BC = 1 and, therefore, the (4.2) holds. 



References: [LL68], [Ga83] 
5. Quadratures 

The simplest mechanical systems are integrable by quadratures. For instance the 
Hamiltonian on 1R 2 

n(p,q) = ^p 2 + V(q) (5.1) 

generates a motion t — > q(t) with initial data go, qo such that 7i(po 5 qo) = E, i.e. \mq^ + 
V(q ) = E, satisfying q(t) = ±y / '^(E — V(q(t))). If the equation E = V(q) has only two 
solutions q~(E) < q+(E) and \d q V(q±(E)) \ > the motion is periodic with period 

^) = 2/-<f> 7¥f A f5r (5.2) 

The special solution with initial data qo = q-(E), qo — will be denoted Q(t) and it is an 
analytic function (by the general regularity theorem on ordinary differential equations). 
For < t < f or for J < t < T it is given, respectively, by 

The most general solution with energy E has the form q(t) = Q(to +t) where to is defined 
by qo = Q(to),qo = 0(^o) 5 i- e - it is the time needed to the "standard solution" Q(t) to 
reach the initial data for the new motion. 

If the derivative of V vanishes in one of the extremes or if one at least of the two 
solutions q±(E) does not exist the motion is not periodic and it may be unbounded: 



3//ebbraio/2008; 13:37 



9 



nevertheless it is still expressible via integrals of the type (5.2). If the potential V is periodic 
in q and the variable q is considered varying on a circle then essentially all solutions are 
periodic: exceptions can occurr if the energy E has a value such that V(q) = E admits a 
solution where V has zero derivative, 

Typical examples are the harmonic oscillator, the pendulum, the Kepler oscillator. 
whose Hamiltonians, if m, u>, g, h, G, k are positive constants, are respectively 



V , 1 22 
— muj q , 

2m 2 y ' 



V 



1 



G 2 



h mg(l — cos — ), mk—+m — = 

2m h 2m \q\ 2q z 



(5.4) 



the latter has a potential which is singular at q = but if G ^ the energy conservation 
forbids too close an approach to q = and the singularity becomes irrelevant. 

The integral in (5.3) is called a quadrature and the systems in (5.1) are therefore 
integrable by quadratures. Such systems, at least in the cases in which the motion is 
periodic, are best described in new coordinates in which periodicity is more manifest. 
Namely when V(q) = E has only two roots q±(E) and =pV' (q±(E)) > the energy-time 
coordinates can be used by replacing q, q or p, q by E, r where r is the time needed to 
the standard solution t — > Q(t) to reach the given data, i.e. Q(t) = q, Q(r) = q. In such 
coordinates the motion is simply (E 1 , r) — > {E, r + t) and, of course, the variable r has to 
be regarded as varying on a circle of radius T/2n. The E, r variables are a kind of polar 
coordinates as can be checked by drawing the curves of constant E, "energy levels" , in the 
plane p, q in the cases in (5.4), see Fig. 1. 

In the harmonic oscillator case all trajectories are periodic. In the pendulum case all 
motions are periodic except the ones which separate the oscillatory motions (the closed 
curves in the second drawing) from the rotatory motions (the apparently open curves) 
which, in fact, are on closed curves as well if the q coordinate, i.e. the vertical coordinate 
in Fig. 1, is regarded as "periodic" with period 2nh. In the Kepler case only the negative 
energy trajectories are periodic and a few of them are drawn in Fig. 1. The single dots 
represent the equilibrium points in phase space. 






(1) 



Fig. 1: The energy levels of the harmonic oscillator, the pendulum, and the Kepler motion 

The region of phase space where motions are periodic is a set of points (p, q) with the 
topological structure of U ue u({u} x C u ) where u is a coordinate varying in an open interval 
U, for instance the set of values of the energy, and C u is a closed curve whose points (p, q) 
are identified by a coordinate, for instance by the time necessary to an arbitrarily fixed 
datum with the same energy to evolve into (p, q) . 

In the above cases, (5.4), if the "radial" coordinate is chosen to be the energy the 
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set U is the interval (0, +00) for the harmonic oscillator, (0, 2mg) or (2mg, +00) for the 
pendulum, and (—^^-,0) in the Kepler case. The fixed datum for the reference motion 
can be taken, in all cases, of the form (0, qo) with the time coordinate to given by (5.3). 

It is remarkable that the energy-time coordinates are canonical coordinates: for in- 
stance in the vicinity of (po, qo) and if po > this can be seen by setting 

S(q,E)= / y / 2m(E -V(x))dx (5.5) 
Jqo 

and checking that p = d q S(q,E),t = dES(q,E) are identities if (p, q) and (E,t) are 
coordinates for the same point so that the criterion expressed by (4.6) applies. 

It is convenient to standardize the coordinates by replacing the time variable by an 
anglect = j?^ t; and instead of the energy any invertible function of it can be used. 

It is natural to look for a coordinate A = A(E) such that the map (p, q)< — >(A, a) is 
a canonical map: this is easily done as the function 

rq 

S(q,A) = j y/2m(E(A) - V{x))dx (5.6) 

generates (locally) the correspondence between p = ^2m(E(A) — V(q)) and a = E'(A) J Q q 
dx Therefore, by the criterion (4.6), if E'(A) = wItjtt, »-e. if A'(E) = 

T ^ , the coordinates (A, a) will be canonical coordinates. Hence, by (5.2), A(E) can be 
taken equal to 

A = J- 2 T +( } y/2m(E-V(q))dq = ^- I pdq (5.7) 

where the last integral is extended to the closed curve of energy E, see Fig. 1. The action- 
angle coordinates (A, a) are defined in open regions of phase space covered by periodic 
motions: in action-angle coordinates such regions have the form W = J x T of a product 
of an open interval J and a one dimensional "torus" T = [0, 2tv] (i.e. a unit circle). 



References: [LL68], [Ar68], [Ga83]. 



6. Quasi periodicity and integrability 



A Hamiltonian is called integrable in an open region W C T*(M) of phase space if 

(1) there is an analytic and non singular (i.e. with non zero Jacobian) change of coordi- 
nates (p, q)< — >(I 5 y) capping W into a set of the form 1 x T £ with 1 C M 1 (open) and 
furthermore 

(2) the flow t — > S t (p, q) on phase space is transformed into (I, ip) — > (I, (p + u>(T) t) where 
w(I) is a smooth function on X. 



This means that in suitable coordinates, that can be called "integrating coordinates" , 
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the system appears as a set of I points with coordinates ip = (<pi, . . . , <pi) moving on a 
unit circle at angular velocities = (u;i(I), . . . depending on the actions of the 

initial data. 

A system integrable in a region W which in integrating coordinates I, ip has the 
form X x T e is said anisochronous if detenu? (I) 7^ 0. It is said isochronous if w(I) = u> 
is independent on I. The motions of integrable systems are called gwasi periodic with 
frequency spectrum or with frequencies uj(I)/2ty, in the coordinates (I, </?). 

Clearly an integrable system admits £ independent constants of motion, the I = 
and, for each choice of I, the other coordinates vary on a "standard" £- 
dimensional torus T £ : hence it is possible to say that a phase space region of integr ability is 
foliated into I- dimensional invariant tori T(I) parameterized by the values of the constants 
of motion I G X. 

If an integrable system is anisochronous then it is canonically integrable: i.e. it is 
possible to define on If a canonical change of coordinates (p, q) = C(A, a) mapping W 

onto JxT £ and such that H(C(A, a)) = h(A) for a suitable h: so that, if u?(A) d = dxh(A), 
the equations of motion become 

A = 0, d = u?(A) (6.1) 

Given a system (I, cp) of coordinates integrating an anisochronous system the construction 
of action-angle coordinates can be performed, in principle, via a classical procedure (under 
a few extra assumptions). 

Let 71, . . . ,ji be £ topologically independent circles on T , for definiteness let 7* (I) = 
{<p I <px = <p 2 = ■ ■ ■ = <Pi-i = <Pi+i = • • • = 0, <fi G [0, 2tt]}, and set 

Ai(i) = ^-<f p-^q (6-2) 

If the map I< — >A(I) is analytically invertible as I = 1(A) the function 

S(A,p) = (X) /Vdq (6-3) 
Jo 

is well defined if the integral is over any path A joining the points (p(I(A), 0), (/(1(A), 0)) 
and (p(I(A), y)), g(I(A), y>) lying on the torus parameterized by 1(A). 

The key remark in the proof that (6.3) really defines a function of the only variables 
A, <p is that anisochrony implies the vanishing of the Poisson brackets (cf. (4.4)): Ij} = 
(hence also {At, Aj} = k di k di h {Ik, Ih} = 0). And the property {/j, Ij} = can be 
checked to be precisely the integrability condition for the differential form p • <iq restricted 
to the surface obtained by varying q while p is constrained so that (p, q) stays on the 
surface I=constant, i.e. on the invariant torus of the points with fixed I. 

The latter property is necessary and sufficient in order that the function S( A, ip) be 
well defined (i.e. be independent on the integration path A) up to an additive quantity of 
the form £V 2nriiAi with n = (n±, . . . , ni) integers.. 

Then the action-angle variables are defined by the canonical change of coordinates 
with S(A, <p) as generating function, i.e. by setting 
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ai = d Ai S(A,<p), Ii = d Vi S(A,<p). (6.4) 

and, since the computation of S(A, ip) is "reduced to integrations" which can be regarded 
as a natural extension of the quadratures discussed in the one dimensional cases, also such 
systems are called integrable by quadratures. The just described construction is a version 
of the more general Arnold- Liouville theorem. 

In practice, however, the actual evaluation of the integrals in (6. 2), (6. 3) can be difficult: 
its analysis in various cases (even as "elementary" as the pendulum) has in fact led to key 
progress in various domains, e.g. in the theory of special functions and in group theory. 

In general any surface on phase space on which the restriction of the differential form 
p • <iq is locally integrable is called a Lagrangian manifold: hence the invariant tori of an 
anisochronous integrable system are Lagrangian manifolds. 

If an integrable system is anisochronous it cannot admit more than £ independent 
constants of motion; furthermore it does not admit invariant tori of dimension > I. Hence 
^-dimensional invariant tori are called maximal. 

Of course invariant tori of dimension < I can also exist: this happens when the 
variables I are such that the frequencies u>(I) admit nontrivial rational relations; i.e. there 
is an integer components vector v G \ v = (yi, . . . , vg) ^ such that 

w(I)-i/ = 5^(1)^ = (6.5) 

i 

in this case the invariant torus T(I) is called resonant. If the system is anisochronous then 
det diu>(T) 7^ and, therefore, the resonant tori are associated with values of the constants 
of motion I which form a set of 0-measure in the space X but which is not empty and 
dense. 

Examples of isochronous systems are the systems of harmonic oscillators, i.e. systems 
with Hamiltonian Yli=i 2m~Pi + k^l'j c ijQiQj where the matrix v is a positive definite 

matrix. This is an isochronous system with frequencies u> = (u>\, . . . whose squares 

_i _i 

are the eigenvalues of the matrix m i 2 Cijirij 2 . It is integrable in the region W of the data 
x = (P, q) E R 2£ such that, setting Ap = ^ ((£- =1 ^=§) 2 + u^TLi ^=?) all 
eigenvectors V/j, (3 = 1, . . . , £, of the above matrix, the vectors A have all components > 

Even though this system is isochronous it, nevertheless, admits a system of canonical 
action-angle coordinates in which the Hamiltonian takes the simplest form 

e 

h(A) = upAp = u> ■ A (6.6) 

(3=1 

v f3,jPi 

with an = — atan =j — 1=1 as conjugate angles. 

An example of anisochronous system is the free rotators or free wheels: i.e. I nonin- 
teracting points on a circle of radius R or £ noninteracting coaxial wheels of radius R. If 
Ji = rriiR 2 or, respectively, Ji = ^rriiR 2 are the inertia moments and if the positions are 
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determined by t angles a = (cti, . . . , at) the angular velocities are constants related to the 
angular momenta A = (Ai, . . . , At) by u>i = A^j ' Jj. The Hamiltonian and the specrum are 



h ( A ) = J2vT A l ^( A ) = (|^). , , (6-7) 

T~7 v Ji / 1=1,.. .,1 

References: [LL68], [Ar68], [Ga83], [Fa98]. 

7. Multidimensional quadratures: central motion 

Important mechanical systems with more than one degree of freedom are integrable 
by canonical quadratures in vast regions of phase space. This is checked by showing 
that there is a foliation into invariant tori T(I) of dimension equal to the number t of 
degrees of freedom parameterized by i constants of motion I in involution, i.e. such that 
{ij, Ij} = 0. One then performs, if possible, the construction of the action-angle variables 
by the quadratures discussed in the previous section. 

The above procedure is well exemplified by the theory of the planar motion of a unit 
mass attracted by a coplanar center of force: the Lagrangian is, in polar coordinates (p, 9) , 
C = tt(/? 2 + p 2 9 2 ) — V(p). The planarity of the motion is not a strong resttriction as 
central motion takes always place on a plane 

Hence the equations of motion are jj^mp 2 9 = 0, i.e. mp 2 9 = G is a constant of motion 

(it is the angular momentum), and p = —d p V(p) + d p ^p 2 '9 2 = —d p V(p) + -^z d — — 
dpVo(p). So that energy conservation yields a second constant of motion E 

^P 2 + \— 2 + V(p) =E=^-p 2 p + ^-4 + V (P) (7-1) 
2 2 rap 1 2m H 2m p z 

The r.h.s. is the Hamiltonian for the system, derived from £, if p p ,pe denote conjugate 
momenta of p, 9: p p = mp and pg = mp 2 9 (note that pg = G). 

Suppose p 2 V(p) p ^ Q > 0: then the singularity at the origin cannot be reached by any 

motion starting with p > if G > 0. Assume also that the function Vq{p) d = + V(p) 

has only one minimum Eq(G), no maximum and no horizontal inflection and tends to a 
limit E ca (G) < oo when p — > oo. Then the system is integrable in the domain W = 
{(p,q)|£?o(G) < E < E^G), G ^ 0}. 

This is checked by introducing a "standard" periodic solution t — > R(t) of mp = 
—dpVoip) with energy Eq{G) < E < E 00 (G) and initial data p = pe,-(G), p = at time 
t = 0, where Pe,±(G) are the two solutions of Vq{p) = E, see Section 5: this is a periodic 

PE,+ (G) dx 



analytic function of t with period T(E, G) = 2 f -.. 

The function R(t) is given, for < t < \T(E, G) or for \T(E, G) < t < T(E, G), by 
the quadratures 

, _ rR(t) dx nr j. _ T(E,G) _ rR(t) dx (7 9 x 

Jp E ,-(G) y/^E-Voix))' OT 2 JpeMG) y/^ { E-Va{x)) ^ ^ 
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respectively. The analytic regularity of R(t) follows from the general existence, uniqueness 
and regularity theorems applied to the differential equation for p. 

Given an initial datum p , Po? #o? #o with energy E and angular momentum G define 
to to be the time such that R(to) = po,R(to) = po: then p(t) = R(t + to) and 9(t) can 
be computed as 9(t) = 9 + J* mR ^ +to ^2 dt': a second quadrature. Therefore we can use 

as coordinates for the motion E, G, to, which determine po, po, 9o and a fourth coordinate 
that determines 9o which could be 9q itself but which is conveniently determined, via the 
second quadrature, as follows. 

The function Gm~ l R(t)~ 2 is periodic with period T(E, G), hence it can be expressed 

in a Fourier series xo 

(E, G) +E fc /o Xk (E, G)e^^ 1 1 ; the quadrature for 9(t) can be per- 
formed by integrating the series terms. Setting 9 (to) d = T( ^ G ' > J2k^o Xk<y ^ G ^ e T ^,G) lt ° k 
and (pi(0) =9 - 9 (to) the 9(t) = 9 + J* mR ^ +to ^ dt' becomes 

<p 1 (t)=<p 1 (0) + X o(E,G)t (7.3) 
Hence the system is integrable and the spectrum is u}(E,G) = (uo(E,G), ui(E,G)) = 
(uo, u>i) with uo d = a) anc ^ Ul d — Xo(E, G), while I = (E, G) are constants of motion 

def def — 

and the angles <p = (cpo, cpi) can be taken ip = uoto, <pi = 9q — 9 (to). At E, G fixed the 
motion takes place on a 2-dimensional torus T(E, G) with <p , <pi as angles. 

In the anisochronous cases, i.e. when det 8e,g uj (E, G) ^ 0, canonical action-angle 
variables conjugated to (p p , p,pe, 9) can be constructed via (6. 2), (6. 3) by using two cycles 
71, 72 on the torus T(E, G). It is convenient to choose 

(1) 71 as the cycle consisting of the points p = x,9 = whose first half (where p p > 0) 
consists in the set pe,-(G) < x < pe,+ (G), p p = ^/2m(E — V G (x)) and d9 = 0; 

(2) 72 as the cycle p = const, 9 G [0, 2ir] on which dp = and pg = G obtaining 

9 rpE,+{G) 

Ai = — ^2m(E-V G (x))dx, A 2 = G (7.4) 

^ JpE,-(G) 

According to the general theory (cf. Sect. 6) a generating function for the canonical 
change of coordinates from (p p , p,pe, 9) to action-angle variables is (if, to fix ideas, p p > 0) 

<-p 



S(A 1 ,A 2 ,p,9) = G9+ j ^2m(E - V G (x))dx (7.5) 

J PE,- 

In terms of the above uo, xo the Jacobian matrix g^'^ is computed from (7. 4), (7. 5) to 
be *° ^ . It follows d E S = t,d G S = 9- 9(t) - X ot so that, see (6.4), 

a x d = d Al S = u t, a 2 = d = d A2 S = 9 - 9(t) (7.6) 
and (Ai,ai), (A 2 ,a 2 ) are the action-angle pairs. 



References: [LL68], [Ga83]. 
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8. Newtonian potential and Kepler's laws 



The anisochrony property, i.e. det 7^ or, equivalently, det g^'g" 7^ 0, is not 

satisfied in the important cases of the harmonic potential and of the Newtonian potential. 
Anisochrony being only a sufficient condition for canonical integr ability it is still possible 
(and true) that, nevertheless, in both cases the canonical transformation generated by 
(7.5) integrates the system. This is expected since the two potentials are limiting cases of 
anisochronous ones {e.g. |q| 2+e and |<3i~ 1_e with e — > 0). 

The Newtonian potential 7i(p,q) = 2^P 2 — ^ is integrable in the region G 7^ 

,2 3 / 1 2 3~ 

0, E Q (G) = -^r <E<Q,\G\< J£$£ . Proceeding as in Section 7 one finds integrat- 
ing coordinates and that the integrable motions develop on ellipses with one focus on the 
center of attraction S so that motions are periodic hence not anisochronous: nevertheless 
the construction of the canonical coordinates via (6. 2), (6. 3), (6. 4) (hence (7.5)), works and 
leads to canonical coordinates (Z/, X' ,G' , 7'). To obtain action-angle variables with a sim- 
ple interpretation it is convenient to perform on the variables (Z/, A', G', 7') (constructed 
by following the procedure just indicated) a further trivial canonical tranformation by 
setting L = V + G' , G = G',X= A', 7 = 7'- A'; then 

A average anomaly, is the time necessary to the point P to move from the pericenter 

to its actual position in units of the period and times 2tv 
L action, is essentially the energy E = — > 
G angular momentum, 

7 axis longitude, is the angle between a fixed axis and the major axis of the ellipse 
oriented from the center of the ellipse O to the center of attraction S. 




Fig. 2: Eccentric and true anomalies of P which moves on a small circle E centered at a point c moving 
on the circle D located half way between the two concentric circles containing the Keplerian ellipse: the 
anomaly of c with respect to the axis OS is £. The circle D is eccentric with respect to S and therefore 
^ is, still today, called eccentric anomaly while the circle D is, in ancient terminology, the deferent circle 
(eccentric circles were introduced in Astronomy by Ptolemy). The small circle E on which the point P 
moves is, in ancient terminology, an epicycle. The deferent and the epicyclical motions are synchronous 
(i.e. have the same period); Kepler discovered that his key o priori hypothesis of inverse proportionality 
between angular velocity on the deferent and distance between P and S {i.e. p£ = constant) implied both 
synchrony and elliptical shape of the orbit, with focus in S. The latter law is equivalent to f?Q = constant 
(because of the identity a£ = p8). Small eccentricity ellipses can be hardly distinguished from circles. 

The eccentricity of the ellipse is e such that G = ±Ly/l — e 2 . The ellipse equation 
is p = a (1 — ecos£) where £ is the eccentric anomaly (see Fig. 2), a — -j^? is the major 
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semiaxis and p is the distance to the center of attraction S. 

Finally the relations between eccentric anomaly £, average anomaly A, true anomaly 
9 (the latter is the polar angle), and SP-distance p are given by Kepler's equations 

A =£ — esin£, (1 — ecos£)(l + ecos6>) = 1 — e 2 , 

'* dff p 1-e 2 (8-1) 



A=(l-e 2 )l f ^ t 

[ Wo (1 + ecos^) 2 ' a 



1 + e cos 

and this relation between true anomaly and average anomaly can be inverted in the form 

£ = \ + gx, e = X + f x =► P - = — 1-62 (8.2) 

a 1 + e cos(A + /a) 

where #a = #(e sin A, e cos A), /a = /(e sin A, e cos A) and g(x,y), f(x,y) are suitable func- 
tions analytic for \x\, \y\ < 1. Furthermore g(x, y) = x (l+y+. . .), f(x, y) = 2x . .) 
and . . . denote terms of degree 2 or higher in x, y, containing only even powers of x. 



References: [LL68], [Ga83]. 



9. Rigid body 



Another fundamental integrable system is the rigid body in absence of gravity and 
with a fixed point O. It can be naturally described in terms of the Euler angles 9o,<fio,i(^o, 
see Fig. 3, and their derivatives #o, ^o, V'o- 



▲ z 

1 3 




Fig. 3: The Euler angles of the comoving frame , i2 , 13 with respect to a fixed frame x,y,z. The 
direction n is the "node line, intersection between the planes x,y and ii,i2- 

Let Ii,l2, h be the three principal inertia moments of the body along the three prin- 
cipal axes with unit vectors ii, 12, 13. The inertia moments and the principal axes are the 
eigenvalues and the associated unit eigenvectors of the 3x3 inertia matrix X which is 
defined by = Yl7=i m i( x i)h( x i)fc' where h, k = 1,2,3 and Xj is the position of the 
i-th particle in a reference frame with origin at O and in which all particles are at rest: 
this comoving frame exists as a consequence of the rigidity constraint. The principal axes 
form a coordinate system which is comoving as well: i.e. also in the frame (O; ii, '12, h) the 
particles are at rest. 
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The Lagrangian is simply the kinetic energy: i.e. we imagine the rigidity constraint to 
be ideal, e.g. realized by internal central forces in the limit of infinite rigidity, as mentioned 
in Sec. 3. The angular velocity of the rigid motion is defined by 



(9.1) 



expressing that a generic infinitesimal motion must consist of a variation of the three Euler 
angles and therefore it has to be a rotation of speeds 07 ip , ip around the axes n, z, i 3 as 
shown in Fig. 3. 

Let (oji , 0J2, be the components of uj along the pricipal axes ii, i2, 13: for brevity 
the latter axes will often be called 1, 2, 3. Then the angular momentum M, with respect 
to the pivot point O, and the kinetic energy K can be checked to be 



K= -(hul + hul + hul) 



(9.2) 



and are constants of motion. From Fig. 3 it follows that ui\ = 9ocosifjo + <fio sin6>o sin-i^o, 
UJ2 = —9q sin ipo + (pQ sin 9q cos i^q and ujs = (po cos 6>o + so that the Lagrangian, unin- 
spiring at first, is 



def 1 

£ = -Ii{0QCOsipo + ^ sin6'osin^o) 2 + 
1 



(9.3) 



+ ^h(-0o sin^o + <po sin/9 cos ^ ) 2 + -h(<Po cos 9 + tp ) 2 

Angular momentum conservation does not imply that the components u)j are constants 
because also ii, i 2 , i3 change with time according to ^ij = u) Aij, j = 1, 2, 3. Hence M = 
becomes, by the first of (9.2) and denoting Iu> = (IiUi, -^3^3), the Euler equations 
Iu> + uj A luj = 0, or 

hull = (I 2 - h)u 2 0O3, I2UJ2 = (h - h)v3Vi- I3U3 = {h - h)uiu 2 (9.4) 
which can be considered together with the conserved quantities, (9.2). 



Miiz 




(4) 



Fig. 4: The laboratory frame, the angular momentum frame, the comoving frame (and the Deprit angles). 

Since angular momentum is conserved it is convenient to introduce the laboratory 
frame (O; x , y~o, z ) with fixed axes x , yo, z and (see Fig. 4): 

(1) (O; x, y, z): the momentum frame with fixed axes, but with z-axis oriented as M, and 
x-axis coinciding with the node (i.e. the intersection) of the plane x yo and the plane 
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xy (orthogonal to M). Therefore x, y, z is determined by the two Euler angles C,7 of 
(O; x, y, z) in (O; x , y , z ), 

(2) (0;1,2,3): the comoving frame, i.e. the frame fixed with the body, and with unit 
vectors ii , i2, 13 parallel to the principal axes of the body. The frame is determined by 
three Euler angles 6>o, ifo, 4>o 

(3) The Euler angles of (O; 1, 2, 3) with respect to (O; x, y, z) which are denoted 6>, </?, ip 

(4) G: the total angular momentum: G 2 = J2j l]^ 2 

(5) M3: the angular momentum along the zo axis; M 3 = Gcos£ 

(6) L: the projection of M on the axis 3, L = G cos 6 



The quantities G, M 3 , L, (p, 7, ip determine 6> , <Po, ipo and (9 , <£>o, V>o, or the pe , p Vo , p^ 
variables conjugated to $o, ipo, tpo as shown by the following comment. 

Considering Fig. 4, the angles £, 7 determine location, in the fixed frame (O; xo, yo, zo) 
of the direction of M and the node line m, which are respectively the z-axis and the x 
axis of the fixed frame associated with the angular momentum; the angles 9, ip, tfj then 
determine the position of the comoving frame with respect to the fixed frame (O; x, y, z), 
hence its position with respect to (O; x , yo, z ), i.e. (0o>V?o>V'o)- From this and G it is 
possible to determine u> because 

Q I3U3 , hU2 2 7— 2/^y2 t2 2 r 2 2\ / n r\ 

cos# = tan-0 = j-jj- , uj 2 =I 2 (G - - I 3 uj 3 ) (9.5) 

and, from (9.1), #0 , ^0 , V'o are determined. 

The Lagrangian (9.3) gives immediately (after expressing u>, i.e. n, z, i 3 , in terms of 
the Euler angles 6>o, <fo, V'o) an expression for the variables pe ,Pip -, Pi>o conjugated to 

#0, <Po, V'o 

Pe = M • n p m = M • z p^o = M i 3 (9.6) 

and in principle we could proceed to compute the Hamiltonian. 

However the computation can be avoided because of the very remarkable property 
(Deprit), which can be checked with some patience, making use of (9.6) and of elementary 
spherical trigonometry identities, 

M3CZ7 + Gdtp + Ld%l) = p<p d(po + Prp di>o + pe o d0 (9.7) 

which means that the map ((M 3 , 7), (L, ip), (G, <p))< — > {{pe , #o), {p Vo , <Po), (Ptp , V'o)) is a 
canonical map. And in the new coordinates the kinetic energy, hence the Hamiltonian, 
takes the form 



*-3 



£ + (G »_W!!£* + £5£*)l (9.8) 

i 3 V li 1 2 /J 



This again shows that G, M3 are constants of motion, and the L, ^ variables are deter- 
mined by a quadrature, because the Hamilton equation for if; combined with the energy 
conservation yields 



3//ebbraio/2008; 13:37 



19 



j nT p ^i2( sin2 i> I cos 2 1/1 ^ 

^ = ±(i - ^ - ^) J : izrtizr 

V 7 3 'l '2 

In the integrability region this motion is periodic with some period Tl(E, G). Once ip(t) 
is determined the Hamilton equation for <p leads to the further quadrature 

, = sin^( t ) + cos^W 

which determines a second periodic motion with period Tg(E, G). The 7, M 3 are constants 
and, therefore, the motion takes place on three dimensional invariant tori Te,g,m z in phase 
space each of which is always foliated into two dimensional invariant tori parameterized 
by the angle 7 which is constant by (9.8) (because K is M 3 -independent): the latter are 
in turn foliated by one dimensional invariant tori, i.e. by periodic orbits, with E, G such 
that the value of T L (E, G)/T G {E, G) is rational. 

Note that if I\ = I2 = I the above analysis is extremely simplified. Furthermore 
if gravity g acts on the system the Hamiltonian will simply change by the addition of a 
potential —mgz if z is the height of the center of mass. Then, see Fig. 4, if the center 
of mass of the body is on the axis '13 and z = h cos 6>o and h is the distance of the center 
of mass from O, since cos6> = cos6>cosC — sin 9 sin ( cos (p, the Hamiltonian will become 
H = K — mgh cos 6> or 



n 



^ + ^-mgh(^-(l- #) 1/2 (1 - g) 1/2 cosv,j (9.11) 



21; 



so that, again, the system is integrable by quadratures (with the roles of ifj and <p "in- 
terchanged" with respect to the previous case) in suitable regions of phase space. This is 
called the Lagrange's gyroscope. 

A less elementary integrable case is when the inertia moments are related as I± = I2 = 
2/3 and the center of mass is in the plane ii, 12 (rather than on the i3 axis) and only gravity 
acts, besides the constraint force on the pivot point O; this is called the Kowalevskaia's 
gyroscope. 



References: [Ga83]. 



10. Other quadratures 



An interesting classical integrable motion is that of a point mass attracted by two 
equal mass centers of gravitational attraction. Or a point ideally constrained to move on 
the surface of a general ellipsoid. 

New integrable systems have been discovered quite recently and have generated a 
wealth of new developments ranging from group theory (as integrable systems are closely 
related to symmetries) to partial differential equations. 
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It is convenient to extend the notion of integrability by saying that a system is inte- 
grable in a region W of phase space if 



(1) there is a change of coordinates (p, q) G W < — >{A, a, Y, y} G (U x T l ) x (V x E m ) 
where U C R*, R m , with £ + m > 1, are open sets and 

(2) the A,Y are constants of motion while the other coordinates vary "linearly": 

(a, y) -> (a + w(A, Y)f , y + v(A, Y)f) (10.1) 
where u>(A, Y), v(A, Y) are smooth functions. 



In the new sense the systems studied in the previous sections are integrable in much 
wider regions (essentially on the entire phase space with the exception of a set of data which 
lie on lower dimensional surfaces forming sets of zero volume). The notion is convenient 
also because it allows us to say that even the systems of free particles are integrable. 

Two very remarkable systems integrable in the new sense are the Hamiltonian systems, 
respectively called Toda lattice (Kruskal, Zabusky), and Calogero lattice, (Calogero, 
MOSER); if (pi, qi) G M 2 they are 

1 n n— 1 

^(p,q)=^E^ + E^"^ +1 " 9l) 

1=1 1=1 (10.2) 

^ n n 1 71 

where m > and k,uj, g > 0. They describe the motion of n interacting particles on a line. 

The integration method for the above systems is again to find first the constants of 
motion and later to look for quadratures, when appropriate. The constants of motion can 
be found with the method of the Lax pairs. One shows that there is a pair of self adjoint 
n x n matrices M(p, q), iV(p, q) such that the equations of motion become 

|M(p,q) = z[M(p,q),iV(p,q)], i = ^1 (10.3) 

which imply that M{t) = L r (t)M(0)L r (t) _1 , with U(t) a unitary matrix. When the equa- 
tions can be written in the above form it is clear that the n eigenvalues of the matrix 
M(0) = M(po, qo) are constants of motion. When appropriate, e.g. in the Calogero lat- 
tice case with u> > 0, it is possible to proceed to find canonical action-angle coordinates: 
a task that is quite difficult due to the arbitrariness of n, but which is possible. 
The Lax pairs for the Calogero lattice (with uj = 0, g = m = 1) are 



M hh = p h , N hh = 0, and M hk = -, N hk = ^ h^k (10.4) 

[Qh - Qk) {Qh - Qk) 

while for the Toda lattice (with m = g = = 1) the non zero matrix elements of M, N 
are 
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M hh =p h , M hih+1 = M h+hh = e -(9ft-«fc+i), 

(10.5) 

N hth+1 = -N h+ljh = i e -(«.-^+i) 

which are checked by first trying the case n = 2. 
Another integrable system (Sutherland) is 

1 n n 

whose Lax pair is related to that of the Calogero lattice. 

By taking suitable limits as n — > oo and as the other parameters tend to or oo 
at suitable rates integrability of a few differential equations, among which the Korteweg- 
deVries equation or the nonlinear Schodinger equation, can be derived. 



As mentioned in Section 1 symmetry properties under continuous groups imply exis- 
tence of constants of motion. Hence it is natural to think that integrability of a mechanical 
system reflects enough symmetry to imply the existence of as many constants of motion, 
independent and in involution, as the number n of degrees of freedom. 

This is in fact always true, and in some respects it is a tautological statement in the 
anisochronous cases. Integrability in a region W implies existence of canonical action- 
angle coordinates (A, a), see Section 6, and the Hamiltonian depends solely on the A's: 
therefore its restriction to W is invariant with respect to the action of the continuous 
commutative group T n of the translations of the angle variables. The actions can be seen 
as constants of motion whose existence follows from Noether's theorem, at least in the 
anisochronous cases in which the Hamiltonian formulation is equivalent to a Lagrangian 
formulation. 

What is nontrivial is to recognize, prior to realizing integrability, that a system ad- 
mits this kind of symmetry: in most of the interesting cases the systems either do not 
exhibit obvious symmetries or they exhibit symmetries apparently unrelated to the group 
T n , which nevertheless imply existence of constants of motion in number sufficient for 
integrability. Hence nontrivial integrable systems possess a "hidden" symmetry under T n : 
the rigid body is an example. 



However very often the symmetries of a Hamiltonian H which imply integrability also 
imply partial isochrony, i.e. imply that the number of independent frequencies is smaller 
than n, see Section 6. Even in such cases often a map exists from the original coordinates 
(p, q) to the integrating variables (A, a) in which A are constants of motion and the 
ot are uniformly rotating angles (some of which are also constant) with spectrum <*>(A) 
which is the gradient d A.h(A) for some function h( A) depending only on a few of the A 
coordinates. However the map might fail to be canonical. The system is then said to be 
bihamiltonian: in the sense that one can represent motions in two systems of canonical 
coordinates, not related by a canonical transformation, and by two Hamiltonian functions 
H and H' = h which generate the same motions in the respective coordinates (the latter 
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changes of variables are sometimes called "canonical with respect to the pair H, H'" while 
the transformations considered in Section 4 are called completely canonical) . 

References: [CD82]. 

11. Generic nonintegrability 

It is natural to try to prove that a system "close" to an integrable one has motions 
with properties very close to quasi-periodic. This is indeed the case but in a rather subtle 
way. That there is a problem is easily seen in the case of a perturbation of an anisochronous 
integrable system. 

Assume that a system is integrable in a region W of phase space which, in the inte- 
grating action-angle variables (A, a), has the standard form U x T £ with a Hamiltonian 
h(A) with gradient w(A) = dxh(A). If the forces are perturbed by a potential which is 
smooth then the new system will be described, in the same coordinates, by a Hamiltonian 
like 

H £ (A,a) = h(A)+ef(A,a) (11.1) 

with h, f analytic in the variables A, a. 

If the system really behaved like the unperturbed one it ought to have I constants of 
motion of the form F £ (A, a) analytic in e near e = and uniform, i.e. single valued (which 
is the same as periodic) in the variables a. However the following theorem (Poincare) 
shows that this is somewhat unlikely a possibility. 

// the matrix d\ A h(A) has rank > 2 the Hamiltonian (11.1) "generically" (an intuitive 
notion precised below) cannot be integrated by a canonical transformation C £ (A, et) which 

(1) reduces to the identity as e — > 0, 

(2) is analytic in e near e = and in (A, a) e U' x T^, with U' C U open. 
Furthermore no uniform constants of motion F £ (A, cx), defined for e near and (A, at) in 
an open domain U' x T^, exist other than the functions of H £ itself. 

Integrability in the sense (1),(2) can be called analytic integrability and it is the 
strongest (and most naive) sense that can be given to the attribute. 

The first part of the theorem, i.e. (1),(2), holds simply because, if integrability was 
assumed, a generating function of the integrating map would have the form A' ■ ct + 
$ e (A', a) with $ admitting a power series expansion in e as $ e = eQ 1 +e 2 $ 2 + . . .. Hence 
$ x would have to satisfy 

«(A0 • d a ^(A', a ) + f(A\ a) = J(A') (11.2) 

where f(A') depends only on A' (hence integrating both sides with da it appears that 
f(A') must coincide with the average of /(A', a) over a). 
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This implies that the Fourier transform f u (A), i/gZ, should satisfy 

/„(A') =0 if w(A') • 1/ = 0, i/ ^ (11.3) 

which is equivalent to the existence of f u {A') such that f u {A) = ^(A') ■i/f l ,(A) for i/ 7^ 0. 
But since there is no relation between ti>(A) and /(A, a) this property "generically" will 
not hold in the sense that as close as wished to a / which satisfies the property (11.3) there 
will be another / which does not satisfy it esentially no matter how "closeness" is defined, 
e.g. with respect to the metric ||/ — g\\ = J2„ l/f(A) — <7„(A)||. This is so because the 
rank of d AA h(A) is higher than 1 and a? (A) varies at least on a two dimensional surface, 
so that uj ■ v = becomes certainly possible for some 1/ ^ while fv{A) in general will 
not vanish, so that $ 1 , hence $ e , does not exist. 

This means that close to a function / there is a function /' which violates (11.3) for 
some v. Of course this depends on what is meant by "close" : however here essentially any 
topology introduced on the space of the functions / will make the statement correct. For 
instance if the distance between two functions is defined by ^ u sup Ae(7 |/„(A) — g u (A)\ 
or by sup A>a |/(A,a) -g(A,a)\. 

The idea behind the last of the theorem statement is in essence the same: consider, 
for simplicity, the anisochronous case in which the matrix d AA h(A) has maximal rank £, 
i.e. the determinant det d AA h(A) does not vanish. Anisochrony implies that u(A) v 7^ 
for all v 7^ and for A on a dense set and this property will be used repeatedly in the 
following analysis. 

Let B{e,A,a) be a "uniform" constant of motion, meaning that it is single-valued 
and analytic in the non simply connected region U x T £ and, for e small, 

B(e, A, a) = B (A, a) + eB 1 (A, a) + e 2 B 2 (A, a) + . . . (11.4) 

The condition that B is a constant of motion can be written order by order in its expansion 
in s: the first two orders are 

w(A) • d a B (A, a) = 0, 

<9 A /(A, a) • d a B (A, a) - d a f(A, a) • d A B (A, a)+ (11.5) 
+ oj(A)-d a B 1 (A,a) = 

Then the above two relations and anisochrony imply that Bq must be a function of A 
only and the second that w(A) • v and 8a Bo (A) • v vanish simoultaneously for all v. 
Hence the gradient of So must be proportional to u}(A), i.e. to the gradient of h(A): 
d aBq(A) = X(A)d a^(A). Therefore generically (because of the anisochrony) it must be 
that So depends on A through h(A): Bq(A) = F(h(A)) for some F. 

Looking again, with the new information, at the second of (11.5) it follows that at fixed 
A the a-derivative in the direction u>(A) of Bi equals F'(h(A)) times the a-derivative of 
/, i.e. B 1 (A, a) = /(A, a) F'(h(A)) + C 1 (A). 

Summarizing: the constant of motion B has been written as B(A, at) = F(h(A)) + 
eF'(h(A)) f(A,a)+eC 1 (A)+e 2 B 2 + . . . which is equivalent to B(A,a) = F(H e )+e(B' + 
eB[ + . . .) and therefore B' + eB[ + ... is another analytic constant of motion. Repeating 
the argument also B' +eB' 1 + . . . must have the form Fi(H £ )+e (Bq+eB" + . . .); conclusion 
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B = F{H £ ) + e Fi(H e ) + e 2 F 2 (H £ ) + ... + e n F n (H £ ) + 0(e n+1 ) (11.6) 

By analyticity B = F £ (H £ (A : ct)) for some F £ : hence generically all constants of motion 
are trivial. 

Therefore a system close to integrable cannot behave as it would naively be expected. 
The problem, however, was not manifest until Poincare's proof of the above results: 
because in most applications the function / has only finitely many Fourier components, 
or at least is replaced by an approximation with this property, so that at least (11.3) and 
even a few of the higher order constraints like (11.5) become possible in open regions of 
action space. In fact it may happen that the values of A of interest are restricted so that 
a; (A) • v = only for "large" values of v and there /„ = 0. Nevertheless the property 
that f„(A) = (u>(A) • u) f u (A) (or the analogous higher order conditions, e.g. (11.5)), 
which we have seen to be necessary for analytic integrability of the perturbed system, can 
be checked to fail in important problems, if no approximation is made on /. Hence a 
conceptual problem arises. 

References: [Po]. 

12. Perturbing functions 

To check, in a given problem, the nonexistence of nontrivial constants of motion along 
the lines indicated in Sect. 11 it is necessary to express the potential, usually given in 
Cartesian coordinates as eV(x), in terms of the action-angle variables of the unperturbed, 
integrable, system. 

In particular the problem arises when trying to check nonexistence of nontrivial con- 
stants of motion when the anisochrony assumption (cf. Sect. 11) is not satisfied. Usually 
it becomes satisfied "to second order" (or higher) : but to show this a more detailed infor- 
mation on the structure of the perturbing function expressed in action-angle variables is 
needed. For instance this is often necessary even when the perturbation is approximated 
by a trigonometric polynomial, as it is essentially always the case in Celestial Mechanics. 

Finding explicit expressions for the action-angle variables is in itself a rather nontrivial 
task which leads to many problems of intrinsic interest even in seemingly simple cases. 
For instance, in the case of the planar gravitational central motion, the Kepler equation 
A = e sin£, see the first of (8.1), must be solved expressing £ in terms of A, see the first of 
(8.2). It is obvious that for small e the variable £ can be expressed as an analytic function 
of s: nevertheless the actual construction of this expression leads to several problems. For 
small e an interesting algorithm is the following. 

Let h{\) = £ — A so that the equation to solve (i.e. the first of (8.1)) is 

Be 

h{\) = esin(A + h(X)) = -e —(A + h{\)) (12.1) 

uX 

where c(A) = cos A; the function A — > h(X) should be periodic in A, with period 27r, and 
analytic in e, A for e small and A real. If h(X) = eh^ + e 2 h^ + . . . the Fourier transform 
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of h( k \\) satisfies the recursion relation 



^ ) = -El E i^)c V0 {i^Yj[h^\ k>l (12.2) 

p =i P~ k 1 +...+k p =k-i 

with c v the Fourier transform of the cosine (c±\ = |, c v = if z/ 7^ ±1), and (of course) 

—ivc v . Eq. (12.2) is obtained by expanding the r.h.s. of (12.1) in powers of h and 
then taking the Fourier transform of both sides retaining only terms of order k in e. 

Iterating the above relation imagine to draw all trees 9 with k "branches" , or "lines" , 
distinguished by a label taking k values, and k nodes and attach to each node v a harmonic 
label v v = ±1 as in Fig. 5. The trees will assumed to start with a root line vr linking a 
point r and the "first node" v, see Fig. 5, and then bifurcate arbitrarily (such trees are 
sometimes called "rooted trees"). 

V 4 
"5 

(5) 

Fig. 5: An example of a tree graph and of its labels. It contains only one simple node (l). Harmonics 
are indicated next to their nodes. Labels distinguishing lines are not marked. 

Imagine the tree oriented from the endpoints towards the root r (not to be considered 
a node) and given a node v call v' the node immediately following it. If v is the first node 
before the root r let v' = r and iv = 1. For each such decorated tree define its numerical 
value 

Vai ^ = n n c ^ ( i2 - 3 ) 

lines l=v'v nodes 

and define a current v(l) on a line / = v'v to be the sum of the harmonics of the nodes 
preceding v'\ v(l) = J2 w <v v v- Call v(Q) the current flowing in the root branch and call 
order of 9 the number of nodes (or branches). Then 

h^= Yl Val ^) ( 12 - 4 ) 

order(6) = k 

provided trees are considered identical if they can be overlapped (labels included) after 
suitably scaling the lengths of their branches and pivoting them around the nodes out of 
which they emerge (the root is always imagined fixed at the origin). 

If the trees are stripped of the harmonic labels their number is finite and it can be 
estimated < k\4 k (because the labels which distinguish the lines can be attached to an 
unlabeled tree in many ways). The harmonic labels (i.e. v v = ±1) can be laid down in 2 k 
ways, and the value of each tree can be bounded by jj2~ k (because c±i = \). 
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Hence J2 U \ fi^\ < A k , which gives a (rough) estimate of the radius of convergence 
of the expansion of h in powers of e: namely .25 (easily improvable to 0.3678 if A k k\ is 
replaced by k k ~ 1 using Cayley's formula for the enumeration of rooted trees). A simple 
expression for h^ k \t(j) (Lagrange) is h^ k \%l)) = ^jd^ -1 sin fc ip (also readable from the tree 
representation): the actual radius of convergence, first determined by Laplace, of the 
series for h can be also determined from the latter expression for h (Rouche) or directly 
from the tree representation: it is ~ .6627. 

One can find better estimates or at least more efficient methods for evaluating the 
sums in (12.4): in fact in performing the sum in (12.4) important cancellations occurr. For 
instance the harmonic labels can be subject to the further strong constraint that no line 
carries zero current because the sum of the values of the trees of fixed order and with at 
least one line carrying current vanishes. 

The above expansion can also be simplified by "partial resummations" . For the pur- 
pose of an example, call simple the nodes with one entering and one exiting line (see Fig. 
5). Then all tree graphs which on any line between two non simple nodes contain any 
number of simple nodes can be eliminated. This is done by replacing, in evaluating the 
(remaining) trees value, the factors v v iv v in (12.3) by u v /u v /(l —ecosifj): then the value 
of a tree 9 becomes a function of ip and e to be denoted Val(6>)^ and (12.4) is replaced by 

oo 

h(,f;) = J2 Ys* £ fc e l ^Val(6% (12.5) 

k=l e,v(e)=v 

order (0) = k 

where the * means that the trees are subject to the further restriction of not containing any 
simple node. It should be noted that the above graphical representation of the solution of 
the Kepler equation is strongly reminiscent of the representations of quantities in terms of 
graphs that occurr often in quantum field theory. Here the trees correspond to 'Feynman 
graphs", the factors associated with the nodes are the "couplings", the factors associated 
with the lines are the "propagators" and the resummations are analogous to the "self- 
energy resummations", while the mentioned cancellations can be related to the class of 
identities called "Ward identities" . Not only the analogy can be shown to be not superficial, 
but it turns out to be even very helpful in key mechanical problems: see Appendix Al. 

The existence of a vast number of identities relating the tree values is shown already 
by the "simple" form of Lagrange's series and by the even more remarkable resummation 
( Levi- C i vita) leading to 

k=l Y 

It is even possible further collection of the series terms to express it as series with 
much better convergence properties, for instance its terms can be reorganized and collected 
(resummed) so that h is expressed as a power series in the parameter 

V = / r , W (12-7) 



l + VT 



e 
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with radius of convergence 1, which corresponds to e = 1 (via a simple argument by Levi- 
Civita). The analyticity domain for the Lagrange series is \rj\ < 1. This also determines 
the Laplace radius value: it is the point closest to the origin of the complex curve \rj(e) | = 1: 
it is imaginary so that it is the root of the equation ee^ 1+e2 / (1 + VT+ £ 2 ) = 1- 

The analysis provides an example, in a simple case of great interest in applications, of 
the kind of computations actually necessary to represent the perturbing function in terms 
of action-angle variables. The property that the function c(A) in (12.1) is the cosine has 
been used only to limit the range of the label v to be ±1; hence the same method, with 
similar results, can be applied to study the inversion of the relation between the average 
anomaly A and the true anomaly 6 and to get, for instance, quickly the properties of /, g 
in (8.2). 

References: [LC]. 

13. Lindstedt and Birkhoff series. Divergences. 

Nonexistence of constants of motion, rather than being the end of the attempts to 
study by perturbation methods motions close to integrable ones, marks the beginning of 
renewed efforts to understand their nature. 

Let (A, a.) G U x T l be action-angle variables defined in the integrability region for an 
analytic Hamiltonian and let h(A) be its value in the action-angle coordinates. Suppose 
that h(A) is anisochronous and let /(A, at) be an analytic perturbing function. Consider, 
for e small, the Hamiltonian 7i e (A, a.) = H,q(A) + ef(A, a). 

Let u>o = uj(Aq) = ^a^o(A) be the frequency spectrum, see Section 6, of one of 
the invariant tori of the unperturbed system corresponding to an action A . Short of 
integrability the question to ask at this point is whether the perturbed system admits an 
analytic invariant torus on which motion is quasi periodic with 

(1) the same spectrum ujq and 

(2) depends analytically on e at least for e small and 

(3) reduces to the "unperturbed torus" {A } x as e — > 0. 

More concretely the question is 

Are there functions H e (i/>), h £ (t/>) analytic in if) G and in e near 0, vanishing as 
e — > and such that the torus with parametric equations 

a = a o + h £ (t/0, c* = v + h e O0) i>eY (13.1) 

def 

is invariant and, if u)q = u>(Ao), the motion on it is simply if) — > if) + u)ot, i.e. it is quasi 
periodic with spectum ujq ? 

In this context Poincare's theorem of Sect. 11 had followed another key result, earlier 
developed in particular cases and completed by him, which provides a partial answer to 
the question. 
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Suppose that w = w(A ) G M satisfies a Diophantine property, namely suppose that 
there exist constants C, r > such that 

\uo-»\>tJ l ^, for all ^ v e Z £ (13.2) 
C|i/| T 

which, for each r > i — 1 /ixerf, is a property enjoyed by all w e 6m£ /or a set o/ zero 
measure. Then the motions on the unperturbed torus run over trajectories that fill the 
torus densely because of the "irrationality" of wo implied by (13.2). Writing Hamilton's 
equations,, ex = dpJ-Lo(A) + ed Af(A, ex), A = —ed OL f{A,cx) with A, ex given by (13.1) 
with if) replaced by t/j+ujt, and using the density of the unperturbed trajectories implied by 
(13.2), the condition that (13.1) are equations for an invariant torus on which the motion 
is tjj — > t/> + u>ot are 

w + (w -d^)h £ (i>) = d A H (A + H e (</0) + ed A f(A + H e (t/>), ^ + h e (^)) 
(w • £MH e (V) = -efl «/(A + H e (^), ^ + h e (^)) 
The theorem referred above (Poincare) is that 

If the unperturbed system is anisochronous and w = w(A ) satisfies (13.2) for some 
C, t > t/iere exist two well defined power series 

h £ W = Er=i £k h^(V0 and H £ (V) = 
E^ 1 £ fc H( fc )(^) which solve (13.3) to all orders in e. The series for H e is uniquely 
determined, and such is also the series for h e up to the addition of an arbitrary constant 
at each order, so that it is unique if h £ is required, as henceforth done with no loss of 
generality, to have zero average over if) . 

The algorithm for the construction is illustrated in a simple case in Section 14, 
(14. 4), (14. 5). Convergence of the above series, called Lindstedt series, even for e small 
has been a problem for a rather long time. Poincare proved existence of the formal 
solution; but his other result, discussed in Sect.ll, casts doubts on convergence although 
it does not exclude it as was immediately stressed by several authors (including Poincare 
himself). The result of Sect. 11 shows impossibility of solving (13.3) for all wo's near 
a given spectrum, analytically and uniformly, but it does not exclude the possibility of 
solving it for a single ujq. 

The theorem admits several extensions or analogues: an interesting one is to the case 
of isochronous unperturbed systems: 

Given the Hamiltonian H s (A,cx) = wo • A + ef(A,cx), with ujq satisfying (13.2) and f 
analytic, there exist power series C £ (A', ex'), u e (A') such that Ti £ (C s (A', ex')) = wo ■ A' + 
it e (A') holds as an equality between formal power series (i.e. order by order in e) and at 
the same time the C s regarded as a map satisfies order by order the condition (i.e. (4. 3) ) 
that it is a canonical map. 

This means that there is a generating function A' ■ ex + $ e (A', ex.) also defined by a 
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formal power series $ e (A', a) = J2kLi ek ®^{A', ct), i.e. such that if C £ (A', ct') = (A, ct) 
then it is true, order by order in powers of e, that A = A' + d a $ e (A' , ct) and ct' = 
ct + 9a'^(A-', ct). The series for $ e , u £ are called Birkhoff series. 

In this isochronous case if Birkhoff series were convergent for small £ and (A', a) in 
a region of the form U x T^, with £/" C M 1 open and bounded, it would follow that, for 
small s, H e would be integrable in a large region of phase space (i.e. where the generating 
function can be used to build a canonical map: this would essentially be U xT l deprived 
of a small layer of points near the boundary of U). However convergence for e small is 
false (in general) as shown by the simple two dimensional example 

H £ (A,a) =u> - A + e(A 2 + /(«)), (A, ct) e R 2 x T 2 (13.4) 

with f(ct) an arbitrary analytic function with all Fourier coefficients f u positive for v 7^ 
and fo = 0. In the latter case the solution is 



u £ (A')=eA 2 , $ £ ( A ',a) = $> fc £ f v j<*»- ^ -— - (13.5) 

fcl (z(^oi^i+^o2^)) fc+1 

The series does not converge: in fact its convergence would imply integrability and, conse- 
quently, bounded trajectories in phase space: however the equations of motion for (13.4) 
can be easily solved explicitly and in any open region near a given initial data there are 
other data which have unbounded trajectories if ^01/(^02 + e) is rational. 

Nevertheless even in this elementary case a formal sum of the series yields 

u{A') = eA' 2 , * e (A',a) = e £ ^f*" , , (13-6) 

H^oi^i + (^20 + e)u 2 ) 

and the series in (13.6) (no longer a power series in e) is really convergent if u> = (u>oi, ^02 + 
e) is a Diophantine vector (by (13.2), because analyticity implies exponential decay oi\f u \). 
Remarkably for such values of e the Hamiltonian H e is integrable and it is integrated by 
the canonical map generated by (13.6), in spite of the fact that (13.6) is obtained, from 
(13.5), via the non rigorous sum rule 

00 1 

J2 zk = for 2^1 (13.7) 

k=0 

(applied to cases with \z\ > 1, which are certainly realized for a dense set of e's even if uj is 
Diophantine because the z's have values z = -^^)- In other words the integration of the 
equations is elementary and once performed it becomes apparent that, if uj is diophantine, 
the solutions can be rigorously found from (13.6). Note that, for instance, this means that 
relations like YlT=o^ h = ~^ are reau y used to obtain (13.6) from (13.5). 

Another extension of Lindstedt series arises in a perturbation of an anisochronous 
system when asking the question of what happens to the unperturbed invariant tori T Uo 
on which the spectrum is resonant, i.e. u>o ■ v = for some v 7^ 0, u e 7L l . The result is 
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that even in such case there is a formal power series solutions showing that at least a few 
of the (infinitely many) invariant tori into which %j is in turn foliated in the unperturbed 
case can be formally continued at e 7^ 0, see Section 15. 

References: [Po]. 

14. Quasi periodicity and KAM stability 

To discuss more advanced results it is convenient to restrict attention to a special (non 
trivial) paradigmatic case 

H £ (A,<x) = ^A 2 + ef(a) (14.1) 

In this simple case (called Thirring model: representing I particles on a circle interacting 
via a potential ef(ac)) the equations for the maximal tori (13.3) reduce to equations for 
the only functions h e : 

(w • d ^) 2 h e (^) = -ed cJ(V + M</0), eT £ (14.2) 

as the second of (13.3) simply becomes the definition of H g because the r.h.s. does not 
involve H £ . 

The real problem is therefore whether the formal series considered in Section 13 con- 
verge at least for small e: and the example (13.4) on the Birkhoff series shows that some- 
times sum rules might be needed in order to give a meaning to the series. In fact whenever 
a problem (of physical interest) admits a formal power series solution which is not conver- 
gent, or which it is not known whether it is convergent, then one should look for sum rules 
for it. 

The modern theory of perturbations starts with the proof of the convergence for s 
small enough of the Lindstedt series (Kolmogorov). The general "KAM" result is 

Consider the Hamiltonian H e (A,ct) = h(A) + ef(A,ct), defined in U = V x T £ with 
V C M. e open and bounded and with f(A,ct),h(A) analytic in the closure V x T £ where 

def 

h(A) is also anisochronous; let ujq = u}{Aq) = <9a^(Ao) and assume that ujq satisfies 
(13.2). Then 

(1) there is ec, T > such that the Lindstedt series converges for \e\ < £c,t> 

(2) its sum yields two function H e (?/>), h e (t/>) on T which parameterize an invariant torus 
T c , T (A ,e), 

(3) on Tc, T (Ao, e) the motion is 1J) — > t/> + u>ot, see (13.1). 

(4) the set of data in U which belong to invariant tori Tc jT (Ao,e) with u>(Ao) satisfying 
(13.2) with prefixed C,t has complement with volume < const C~ a for a suitable a > 
and with area also < const C~ a on each nontrivial surface of constant energy H e = E. 

In other words for e small the spectra of most unperturbed quasi periodic motions 
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can still be found as spectra of perturbed quasi periodic motions developing on tori which 
are close to the corresponding unperturbed ones (i.e. with the same spectrum). 

This is a stability result: for instance in systems with two degrees of freedom the 
invariant tori of dimension 2 which lie on a given energy surface, which has 3 dimensions, 
will separate the points on the energy surface into the set which is "inside" the torus and 
the set which is "outside" : hence an initial datum starting (say) inside cannot reach the 
outside. Likewise a point starting between two tori has to stay in between forever: and if 
the two tori are close this means that motion will stay very localized in action space, with 
a trajectory accessing only points close to the tori and coming close to all such points, 
within a distance of the order of the distance between the confining tori. The case of three 
or more degrees of freedom is quite different, see Sect. 17, 19. 

In the simple case of the rotators system (14.1) the equations for the parametric 
representation of the tori are the (14.2). The latter bear some analogy with the easier 
problem in (12.1): but the (14.2) are I equations instead of one and they are differential 
equations rather than ordinary equations. Furthermore the function /(a) which plays here 
the role of c(A) in (12.1) has Fourier coefficients f u with no restrictions on i/, while the 
Fourier coefficiens c v for c in (12.1) do not vanish only for v = ±1. 

The above differences are, to some extent, "minor" and the power series solution to 
(14.2) can be constructed by the same algorithm used in the case of (12.1): namely one 
forms trees as in Fig. 5 with the harmonic labels v v G Z replaced by u v G (still to 
be thought of as possible harmonic indices in the Fourier expansion of the perturbing 
function /). All other labels affixed to the trees in Sec. 11 will be the same. In particular 
the current flowing on a branch I = v'v will be defined as the sum of the harmonics of the 
nodes w < v preceding v: 



u(l) = f ( 14 - 3 ) 

w<v 



and we call v(9) the current flowing in the root branch. 

This time the value Val(#) of a tree has to be defined differently because the equation 
(14.2) to be solved contains the differential operator (c^o • d ^) 2 which in Fourier transform 
becomes multiplication of the Fourier component with harmonic v by (iuj ■ v) 2 . 

The variation due to the presence of the operator (u> ■ d and the necessity of its 
inversion in the evaluation of u- hi, , i.e. of the component of h„ along an arbitrary unit 
vector u, is nevertheless quite simple: the value of a tree graph 9 of order k (i.e. with k 
nodes and k branches) has to be defined by (cf. (12.3)) 



v.w n f^)( n u) a") 

lines 1= v'v y^O ^v')) nodes v 



where the v v > appearing in the factor relative to the root line rv from the first node v to 
the root r, see Fig. 5, is interpreted as an unit vector u (it was interpreted as 1 in the "one 
dimensional" case (12.1)). The (14.4) makes sense only for trees in which no line carries 
current. Then the component along u (the harmonic label attached to the root of a tree) 
of h( fc ) is given, see also (12.4), by 
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u • h?) = Yl Val ( 6 ') ( 14 - 5 ) 

e, v{d)=v> 

order(0) —k 

where the * means that the sum is only over trees in which a non zero current u(l) flows 
on the lines / G 9. The quantity u ■ hg will be 0, see Section 13. 

In the case of (12.1) zero current lines could appear: but the contributions from 
tree graphs containing at least one zero current line would cancel. In the present case 
the statement that the above algorithm actually gives by simply ignoring trees with 
lines with current is non trivial. It has been Poincare's contribution to the theory 
of Lindstedt series to show that even in the general case, cf.(13.3), the equations for the 
invariant tori can be solved by a formal power series. The (14.5) is proved by induction 
on k after checking it for the first few orders. 

The algorithm just described leading to (14.4) can be extended to the case of the 
general Hamiltonian considered in the KAM theorem. 

The convergence proof is more delicate than the (elementary) one for the equation 
(12.1). In fact the values of trees of order k can give large contributions to h^: because 

the "new" factors (ljq • ^(0) > although not zero, can be quite small and their small size 
can overwhelm the smallness of the factors f v and e: in fact even if / is a trigonometric 
polynomial (so that f u vanishes identically for \u\ large enough) the currents flowing in the 
branches can be very large, of the order of the number k of nodes in the tree, see (14.3). 

This is called the small divisors problem. The key to its solution goes back to a related 
work (Siegel) which shows that 

Consider the contribution to the sum in (14-3) from graphs 9 in which no pairs of lines 
which lie on the same path to the root carry the same current and, furthermore, the node 
harmonics are bounded by \v\ < N for some N . Then the number of lines I in 9 with 
divisor u>q ■ ui satisfying 2~ n < C |u>o • ve\ < 2~ n+1 does not exceed 4 N k2~ n / T . 

Hence, setting F d = C 2 max.\„\<N the corresponding Val(6>) can be bounded by 

k\ k\ 

n=0 

since the product is convergent. In the case in which / is a trigonometric polynomial of 

(k) 

degree iV the above restricted contributions to u • would generate a convergent series 
for e small enough. In fact the number of trees is bounded, as in Sec. 12, by k\4 k (2N + l) ek 
so that the series \e\ k |u-h£, fe ^| would converge for s small (i.e. \s\ < (B -4(2N + 1)^) _1 ). 

Given this comment the analysis of the "remaining contributions" becomes the real 
problem and it requires new ideas because among the excluded trees there are some simple 
k-th order trees whose value alone, if considered separately from the other contributions, 
would generate a factor ially divergent power series in e. 
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However the contributions of all large valued trees of order k can be shown to cancel: 
although not exactly (unlike the case of the elementary problem of Sec. 12, where the 
cancellation is not necessary for the proof, in spite of its exact occurrence), but enough so 
that in spite of the existence of exceedingly large values of individual tree graphs their total 
sum can still be bounded by a constant to the power k so that the power series actually 
converges for e small enough. The idea is discussed in Appendix Al. 

References: [Po], [Ko55], [Mo62], [Ar68]. 

15. Resonances and their stability 

A quasi periodic motion with r rationally independent frequencies is called resonant 
if r is strictly less than the number £ of degrees of freedom. The difference s = £ — r is the 
degree of the resonance. 

Of particular interest are the cases of a perturbation of an integrable system in which 
resonant motions take place. 

A typical example is the n-body problem which studies the mutual perturbations of 
the motions of n — 1 particles gravitating around a more massive particle. If the particles 
masses can be considered negligible the system will consist of n — 1 central Keplerian 
motions: it will therefore have £ = 3(n — 1) degrees of freedom. In general, only one 
frequency per body occurs in absence of the perturbations (the period of the Keplerian 
orbit): hence r < n — 1 and s > 2(n — 1) (or in the planar case s > (n — 1)) with equality 
holding when the periods are rationally independent. 

Another example is the rigid body with a fixed point perturbed by a conservative 
force: in this case the unperturbed system has 3 degrees of freedom but, in general, only 
two frequencies, see Section 9 after (9.10). 

Furthermore in the above examples there is the possibility that the independent fre- 
quencies assume, for special initial data, values which are rationally related, giving rise to 
resonances of even higher order (i.e. with smaller values of r). 

In an integrable anisochronous system resonant motions will be dense in phase space 
because the frequencies w(A) will vary as much as the actions and therefore resonances of 
any order (i.e. any r < £) will be dense in phase space: in particular the periodic motions 
(i.e. the highest order resonances) will be dense. 

Resonances, in integrable systems, can arise in a priori stable integrable systems and 
in a priori unstable systems: the first are systems whose Hamiltonian admits canonical 
action-angle coordinates (A, ex) G U x T l with U C M 1 open, while the second are systems 
whose Hamiltonian has, in suitable local canonical coordinates, the form 

Wo(A) + jr \(p\ - A?g?) + |(tt? + ^ 2 ), Ai? ^ > o (15.1) 

i=l j=l 

where (A, ex.) G U xT, U e M r , (p, q) G V C R 2s \ (tt, k) G V C M 2 * 2 with V, V 
neighborhoods of the origin and £ = r + s\ + S2, Sj > 0, s\ + S2 > and i^/juj are 
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called Lyapunov coefficients of the resonance. The perturbations considered are supposed 
to have the form ef(A, ct, p, q, 7r, k). The denomination of a priori stable or unstable 
refers to the properties of the "a priori given unperturbed Hamiltonian" . The name of a 
priori unstable is certainly appropriate if s\ > 0: here also s\ = is allowed for notational 
convenience implying that the Lyapunov coefficients in a priori unstable cases are all of 
order 1 (whether real, Xj or imaginary i^JJxf). In other words the a priori stable case, 




s\ = $2 = in (15.1), is the only excluded case. Of course the stability properties of the 
motions when a perturbation acts will depend on the perturbation in both cases. 

The a priori stable systems have usually a great variety of resonances {e.g. in the 
anisochronous case resonances of any dimension are dense). The a priori unstable systems 
have (among possible other resonances) some very special r-dimensional resonances occur- 
ring when the unstable coordinates (p, q) and (7r, k) are zero and the frequencies of the r 
action-angle coordinates are rationally independent. 

In the first case, a priori stable, the general question is whether the resonant motions, 
which form invariant tori of dimension r arranged into families that fill I dimensional in- 
variant tori, continue to exists, in presence of small enough perturbations e/(A, ck), on 
slightly deformed invariant tori. Similar questions can be asked in the a priori unsta- 
ble cases. To examine more closely the matter consider the formulation of the simplest 
problems. 

A priori stable resonances: More precisely let {Ao} x T £ be the unperturbed invariant 
torus 7a with spectrum u>q = w(Aq) = <9a7^o(Ao) with only r rationally independent 

components. For simplicity suppose that u> = (oji, . . . , uj r , 0, . . . , 0) d = (a;, 0) with u> G Mf . 
The more general case in which u> has only r rationally independent components can be 
reduced to special case above by a canonical linear change of coordinates at the price of 
changing the Hq to a new one, still quadratic in the actions but containing mixed products 
AiBji the proofs of the results that are discussed here would not be really affected by such 
more general form of 7i. 

It is convenient to distinguish between the "fast" angles ai, . . . , a r and the "resonant" 
angles a r +i, . . . , ag (also called "slow" or "secular") and call a = (a' , (3) with ex.' e T r 
and /3 G T s . Likewise we distinguish the fast actions A' = (A±, . . . , A r ) and the resonant 
ones A r+1 , . . . , A t and set A = (A', B) with A' G W and B G R s . 

Therefore the torus 7a , A = (Aq,B ), is in turn a continuum of invariant tori 
7a ,/3 with trivial parametric equations: (3 fixed, a! = if), if) G T r , and A' = Aq, B = Bo. 
On each of them the motion is: A',B,/3 constant and ct' — > a' + ujt, with rationally 
independent u> G Mf . 

Then the natural question is whether there exist functions h e ,k e ,H e ,K e smooth in 
e near e = and in V £ T r , vanishing for e = 0, and such that the torus TA ,/3 , e with 
parametric equations 




A' =A' + H e (VO, 
B=B + K e (V), 



a! = V + h e (i/0, 
(3 = A) + k e (V), 



■0 G T r 



(15.2) 



is invariant for the motions with Hamiltonian 7Y e (A, at) 



\K' 2 + iB 2 +£/(«', (3) and 
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the motions on it are ift — > ij) + u>t. The above property, when satisfied, is summarized by 
saying that the unperturbed resonant motions A = (Aq,Bq), cx = (ct' + u)'t, (3q) can be 
continued in presence of perturbation ef, for small e, to quasi periodic motions with the 



A priori unstable resonances: here the question is whether the special invariant tori con- 
tinue to exist in presence of small enough perturbations, of course slightly deformed. This 
means asking whether, given Aq such that lj(Aq) = <9a7~^o(Ao) has rationally independent 
components, there are functions (H e (?/>), h e (t/>)), (P e (?/>), Q e (V0) an d (^('/ , ),K £ (i/')) 
smooth in e near e = 0, vanishing for e = 0, analytic in if) e T r and such that the 
r-dimensional surface 



is an invariant torus Ta , £ on which the motion is if) — > i/> + u>(Ao)£. Again the above 
property is summarized by saying that the unperturbed special resonant motions can be 
continued in presence of perturbation ef for small e to quasi periodic motions with the 
same spectrum and on a slightly deformed torus 7a 0iS . 

Some answers to the above questions are presented in the following section. 

References: [GBG04]. 

16. Resonances and Lindstedt series 

We discuss the equations (15.2) in the paradigmatic case in which the Hamiltonian 
H (A) is ±A 2 (cf. (14.1)). It will be w(A') = A' so that A = w, B = and the 
perturbation /(a) can be considered as a function of a = (a',/3): let f((3) be defined as 
its average over ex' . The determination of the invariant torus of dimension r which can be 
continued in the sense discussed in Sect. 15 is easily understood in this case. 

A resonant invariant torus which, among the tori Ta ,p, has parameric equations that 
can be continued as a formal powers series in e is the torus Ta o ,0o with Ad a stationarity 
point for f(/3), i.e. an equilibrium point for the average perturbation: dpf(/3o) = 0. In fact 
the following theorem holds 

If u) G Mf satisfies a Diophantine property and if (3q is a nondegenerate stationarity point 
for the "fast angle average" f((3) (i.e. such that detd^ /3 f(f3o) ^ 0), then the equations for 
the functions h e , k e ; 



same spectrum and on a slightly deformed torus 7a' p , e - 



A=A + H e (VO, 

P=P e (V>), 
7z=n £ (if)), 



oc = if) + h e (if)) 

q = Q e (V>) 

K = K £ (lf)) 



if)eT 



(15.3) 




ed a ,f(if) + h e (if)),f3 + k e (if))) 
edf3f(if) + h £ (ij)) + -k £ (ij))) 



(16.1) 
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Given the simplicity of the Hamiltonian that we are considering, i.e. (14.1), it is not 
necessary to discuss the functions H e ,K e because the equations that they should obey 
reduce to their definitions as in the case of Sec. 14, and for the same reason. 

In other words also the resonant tori admit a Lindstedt series representation. It is 
however very unlikely that the series are, in general, convergent. 

Physically this new aspect is due to the fact that the linearization of the motion near 
the torus Za. ,/3 introduces oscillatory motions around Tpj p with frequencies propor- 
tional to the square roots of the positive eigenvalues of the matrix edppf((3o): therefore 
it is naively expected that it has to be necessary that a Diophantine property be required 
on the vector (u>, y/sjli, ■ ■ ■) where Efij are the positive eigenvalues. Hence some values of 
e, those for which (a>, y/efii, . . .) is not a Diophantine vector or is too close to a non Dio- 
phantine vector, should be excluded or at least should be expected to generate difficulties. 
Note that the problem arises no matter what is supposed on the non degenerate matrix 
dppfiflo) since e can have either sign; and no matter how |e| is supposed small. But we can 
expect that if the matrix dppf^o) is (say) positive definite (i.e. (3q is a minimum point 
for /(/3)) then the problem should be easier for e < and viceversa if flo is a maximum 
it should be easier for e > (i.e. in the cases in which the eigenvalues of edppf((3o) are 
negative and their roots do not have the interpretation of frequencies). 

Technically the sums of the formal series can be given (so far) a meaning only via 
summation rules involving divergent series: typically one has to identify in the formal 
expressions (denumerably many) geometric series which although divergent can be given 
a meaning by applying the rule (13.7). Since the rule can only be applied if z ^ 1 this 
leads to conditions on the parameter e, in order to exclude that the various z that have to 
be considered are too close to 1. Hence this stability result turns out to be rather different 
from the KAM result for the maximal tori. Namely the series can be given a meaning 
via summation rules provided / and (3q satisfy certain additional conditions and provided 
certain values of e are excluded. An example of a theorem is the following: 

Given the Hamiltonian (14-1) o,nd a resonant torus T^ 1 p with u> = A' e Mf satisfying a 
Diophantine property let (3q be a non degenerate maximum point for the average potential 

J((3) = (2ir)~ r f r f(a',/3)d r a'. Consider the Lindstedt series solution for the equations 
(16.1) of the perturbed resonant torus with spectrum (u>,0). It is possible to express the 
single n-th order term of the series as a sum of many terms and then rearrange the series 
thus obtained so that the resummed series converges for e in a domain £ which contains a 
segment [0, £o] an d also contains a subset of [— Eq,0] which, although with open dense com- 
plement, is so large that it has as a Lebesgue density point. Furthermore the resummed 
series for h e , k e define an invariant r dimensional analytic torus with spectrum u>. 

More generally if (3q is only a nondegenerate stationarity point for f(f3) the domain 
of definition of the resummed series is a set S C [— £o,£o] which on both sides of the origin 
has an open dense complement although it has as a Lebesgue density point. 

The above theorem can be naturally extended to the general case in which the Hamil- 
tonian is the most general perturbation of an anisochronous integrable system H £ (A, at) = 
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h(A) +ef(A, a) if d AA h is a non singular matrix and the resonance arises from a spectrum 
u}(Aq) which has r independent components (while the remaining are not necessarily 0). 

We see that the convergence is a delicate problem for the Lindstedt series for nearly 
integrable resonant motions. They might even be divergent (mathematically a proof of 
divergence is an open problem but it is a very resonable conjecture in view of the above 
physical interpretation) , nevertheless the above theorem shows that sum rules can be given 
that "sometimes", i.e. for e in a large set near e = 0, yield a true solution to the problem. 

This is reminiscent of the phenomenon met in discussing perturbations of isochronous 
systems in (13.4), but it is a much more complex situation. And it leaves many open 
problems: foremost of them is the question of uniqueness. The sum rules of divergent 
series always contain some arbitrary choices: which lead to doubt about the uniqueness of 
the functions parameterizing the invariant tori constructed in this way. It might even be 
that the convergence set S may depend upon the arbitrary choices, and that considering 
several of them no e with \e\ < Sq is left out. 

The case of a priori unstable systems has also been widely studied: in this case 
too resonances with Diophantine r-dimensional spectrum u> are considered. However in 
the case S2 = (called a priori unstable hyperbolic resonance) the Lindstedt series can 
be shown to be convergent while in the case s± = (called a priori unstable elliptic 
resonance) or in the mixed cases si,S2 > extra conditions are needed. They involve u> 
and n = ([Mi, . . . , [i S2 ), °f- (15-1), and properties of the perturbations as well. It is also 
possible to study a slightly different problem: namely to look for conditions on u>,[x,f 
which imply that for small e invariant tori with spectrum e-dependent but close, in a 
suitable sense, to u) exist. 

The literature is vast but it seems fair to say that, given the above comments, particu- 
larly those concerning uniqueness and analyticity, the situation is still quite unsatisfactory. 

References: [GBG04]. 

17. Diffusion in phase space 

The KAM theorem implies that a perturbation of an analytic anisochronous inte- 
grable system, i.e. with an analytic Hamiltonian H e (A, a) = Ho(A) +ef(A,a.) and non 
degenerate Hessian matrix d AA h(A), generates large families of maximal invariant tori. 
Such tori lie on the energy surfaces but do not have codimension 1 on them, i.e. do not 
split the (2£ — l)-dimensional energy surfaces into disconnected regions except, of course, 
in the case of 2-degrees of freedom systems, see Sect. 14.. 

Therefore there might exist trajectories with initial data close in action space to A 1 
which reach phase space points close in action space to A-^ ^ A 1 for e ^ 0, no matter how 
small. Such diffusion phenomenon would occurr in spite of the fact that the corresponding 
trajectory has to move in a space in which very close to each {A} x T there is an invariant 
surface on which points move keeping A constant within O(e), which for e small can be 
< |A' - A l |. 

In a priori unstable systems (cf. Sect. 15) with si = 1, S2 = it is not difficult to 
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see that the corresponding phenomenon can actually happen: the paradigmatic example 
(Arnold) is the a priori unstable system 

A 2 p 2 

H e = — + A 2 + — + g (cos q — 1) + e (cos a\ + sma 2 )(cosq — 1) (17.1) 

— ■ 

This is a system describing a motion of a "pendulum" ((p, q) coordinates) interacting with 
a "rotating wheel" ((Ai,a\) coordinates) and a "clock" ((^2,0:2) coordinates) a priori 
unstable near the points p = 0, q = 0, 2n, (si = 1, s 2 = 0, Ai = yfg, cf. (15.1)). And it 
can be proved that on the energy surface of energy E and for each e 7^ small enough 
(no matter how small) there are initial data with action coordinates close to A 1 = (A\, A 2 ) 
with \A\ 2 + A\ close to E eventually evolving to a datum A' = (A[, A' 2 ) with A\ at 

distance from A{ smaller than an arbitrarily prefixed distance (of course with energy E) . 
Furthermore during the whole process the pendulum energy stays close to within o(e) 
(i.e. the pendulum swings following closely the unperturbed separatrices) . 

In other words (17.1) describes a machine (the pendulum) which, working approxi- 
mately in a cycle, extracts energy from a reservoir (the clock) to transfer it to a mechanical 
device (the wheel). The statement that diffusion is possible means that the machine can 
work as soon as e 7^ 0, if the initial actions and the initial phases (i.e. oti,a2,p,q) are 
suitably tuned (as functions of e) . 

The peculiarity of the system (17.1) is that the unperturbed pendulum fixed points 
P± (i.e. the equilibria p = 0, q = 0, 2tt) remain unstable equilibria even when e 7^ 0: and 
this is an important simplifying feature. 




Fig. 6: The first drawing represents the e = geometry: the "partial energy" lines are parabolae, 

\a\ + A2 = const. The vertical lines are the resonances A\ = rational {i.e. v\A\ + V2 = 0). The 

disks are neighborhoods of the points A 1 and A-f (the dots at their centers). The second drawing (e 7^ 0) 

is an artist rendering of a trajectory in A space, driven by the pendulum swings to accelerate the wheel 

from A 1 ^ to A^ at the expenses of the clock energy, sneaking through invariant tori (not represented and 

approximately) located "away" from the intersections between resonances and partial energy lines (a dense 

set, however). The pendulum coordinates are not shown: its energy stays close to within a power of s. 

Hence the pendulum swings staying close to the separatrix. The oscillations symbolize the wiggly behavior 

of the partial energy + A2 in the process of sneaking between invariant tori which, because of their 

invariance, would be impossible without the pendulum. The energy ^A^ of the wheel increases slightly at 

each pendulum swing: accurate estimates yield an increase of the wheel speed A\ of the order of e/loge -1 

— - — 1 —1 

at each swing of the pendulum implying a transition time of the order of g 2 e 1 log s . 

It permits bypassing the obstacle, arising in the analysis of more general cases, rep- 
resented by the resonance surfaces consisting in the A's with A\V\ + is 2 = 0: the latter 
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correspond to harmonics (^1,^2) present in the perturbing function, i.e. the harmonics 
which would lead to division by zero in an attempt to construct (as necessary in the proof 
by Arnold's method) the parametric equations of the perturbed invariant tori with action 
close such A's. In the case of (17.1) the problem arises only on the resonance marked in 
Fig. 6 by a heavy line i.e. A\ = corresponding to coscti in (17.1). 

If e = the points P- with p = 0, q = and the point P + with p = 0, q = 2n 
are both unstable equilibria (and they are of course the same point, if q is an angular 
variable). The unstable manifold (it is a curve) of P+ coincides with the stable manifold 
of P- and viceversa. So that the unperturbed system admits non trivial motions leading 
from P+ to P- and from P_ to P+ , both in a biinfinite time interval (—00, 00): the p, q 
variables describe a pendulum and P± are its unstable equilibria which are connected by 
the separatrices (which constitute the 0-energy surfaces for the pendulum). 

The latter property remains true for more general a priori unstable Hamiltonians 

H e = H (A) + H u (p, q) +ef(A, a, p, q), in (U x T £ ) x (M 2 ) (17.2) 

where 7i u is a one dimensional Hamiltonian which has two unstable equilibrium points 
P+ and P- linearly repulsive in one direction and linearly attractive in another which are 
connected by two heteroclinic trajectories which, as time tends to ±00, approach P_ and 
P + and viceversa. 

Actually the points need not be different but, if coinciding, the trajectories linking 
them must be nontrivial: in the case (17.1) the variable q can be considered an angle 
and then P + and _P_ would coincide (but are connected by nontrivial trajectories, i.e. by 
trajectories that visit points different from P±). Such trajectories are called heteroclinic if 
P+ 7^ P- and homoclinc if P + = P- . 

In the general case besides the homoclinicity (or heteroclinicity) condition certain weak 
genericity conditions, automatically satisfied in the example (17.1), have to be imposed in 
order to show that given A* and A-^ with the same unperturbed energy E one can find, for 
all e small enough but not equal to 0, initial data (e-dependent) with actions arbitrarily 
close to A* which evolve to data with actions arbitrarily close to A A This is a phenomenon 
called Arnold diffusion. Simple sufficient conditions for a transition from near A* to near 

are expressed by the following result 

Given the Hamiltonian (17.2) with H u admitting two hyperbolic fixed points P± with het- 
eroclinic connections, t — > (p a (t), q a (t)), a = 1,2, suppose that 

(1) On the unperturbed energy surface of energy E = Tt(A l ) +TC U (P±) there is a regular 
curve 7 : s — > A(s) joining A 1 to A-* such that the unperturbed tori {A(s)} x T can be 
continued at e 7^ into invariant tori Tx(s),e f or a se t °f values of s which fills the curve 
7 leaving only gaps of size of order o(e). 

(2) The £ x £ matrix Dij of the second derivatives of the integral of f over the heteroclinic 
motions is not degenerate, i.e. 



detD\ = \det(f dtd aiaj f(A,a + u(A)t,p a (t),q a (t)))\>c>0 (17.3) 
^ J — 00 
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for all A 's on the curve 7 and all a e T . 

Given arbitrarily p > 0, for £ ^ small enough there are initial data with action and 
energy closer than p to A 1 and E, respectively, which after a long enough time acquire an 
action closer than p to A? (keeping the initial energy). 

The above two conditions can be shown to hold generically for many pairs A* ^ 
(and many choices of the curves 7 connecting them) if the number of degree of freedom is 
> 3. Thus the result, obtained by a simple extension of the argument originally outlined 
by Arnold to discuss the paradigmatic example (17.1), proves the existence of diffusion in 
a priori unstable system. The integral in (17.3) is called Melnikov integral. 

The real difficulty is to estimate the time needed for the transition: it is a time that 
obviously has to diverge as e — > 0. Assuming g fixed (i.e. ^-independent) a naive approach 
easily leads to estimates which can even be worse than 0(e ae b ) with some a, b > 0. It 
has finally been shown that in such cases the minimum time can be, for rather general 
perturbations ef(ct, q), estimated above by 0(e~ 1 loge -1 ), which is the best that can be 
hoped for under generic assumptions. 

References: [Ar68], [CV00]. 

18. Long time stability of quasi periodic motions 

A more difficult problem is whether the same phenomenon of migration in action 
space occurs in a priori stable systems. The root of the difficulty is a remarkable stability 
property of quasi periodic motions. Consider Hamiltonians 7Y e (A, a) = h(A) + ef(A, a) 
with Hq(A) = h(A) strictly convex, analytic and anisochronous on the closure U of an 
open bounded region U C M 1 , and a perturbation ef(A, a) analytic in U x T^. 

Then a priori bounds are available on how long it can possibly take to migrate from 
an action close to Ai to one close to A2: and the bound is of "exponential type" as e — > 
(i.e. it admits a lower bound which behaves as the exponential of an inverse power of e). 
The simplest theorem is (NEKHOROSSEV): 

There are constants < a, 6, d, g, r such that any initial datum (A, a) evolves so that the 
A will not change by more than as 9 before a long time bounded below by re bs 

Thus this puts an exponential bound, i.e. a bound exponential in an inverse power 
of e, to the diffusion time: before a time re be actions can only change by 0(e 9 ) so that 
their variation cannot be large no matter how small e 7^ is chosen. This places a (long) 
lower bound to the time of diffusion in a priori stable systems. 

The proof of the theorem provides, actually, an interesting and detailed picture of the 
actions variations showing that some actions ay vary slower than others. 

The theorem is constructive, i.e. all constants < a,b,d,T can be explicitly chosen 
and depend on £,TCo, f although some of them can be fixed to depend only on t and on 
the minimum curvature of the convex graph of Hq. Its proof can be adapted to cover 
many cases which do not fall in the class of systems with strictly convex unperturbed 
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Hamiltonian. And even to cases with a resonant unperturbed Hamiltonian. 

However in important problems, e.g. in the 3 body problems met in Celestial Me- 
chanics, there is empirical evidence that diffusion takes place at a fast pace (i.e. not expo- 
nential slow in the above sense) while the above results would forbid a rapid migration 
in phase space if they applied: however in such problems the assumptions of the theorem 
are not satisfied, because the unperturbed system is strongly resonant (as in the celes- 
tial mechanics problems where the number of independent frequencies is a fraction of the 
number of degrees of freedom and h(A) is far from strictly convex), leaving wide open the 
possibility of observing rapid diffusion. 

And changing the assumptions can dramatically change the results. For instance rapid 
diffusion can sometimes be proved even though it might be feared that it should require 
exponentially long times: an example that has been proposed is the case of a three time 
scales system, with Hamiltonian 

2 

P 

ujiAx + uj 2 A 2 + — + g(l + cosq) + ef(a u a 2 ,p, q) (18-1) 

with u> e d = (ui,u>2) with uji = e~^oJ, 002 = e^uj and JU, u > constants. The three scales 
are u^ 1 , \fg~^ .uj^ 1 ■ In this case there are many (although by no means all) pairs Ai, A2 
which can be connected within a time that can be estimated to be of order 0(s~ 1 loge -1 ). 

This is a rapid diffusion case in a a priori unstable system in which condition (17.3) is 
not satisfied: because the £-dependence of u>(A) implies that the lower bound c in (17.3) 
must depend on e (and be exponentiallly small with an inverse power of e as e — > 0). 

The unperturbed system in (18.1) is non resonant in the part Ho for e > outside 
a set of zero measure (i.e. where the vector u> £ satisfies a suitable Diophantine property) 
and, furthermore, it is a priori unstable: cases met in applications can be a priori stable 
and resonant (and often not anisochronous) in the part TCq. And in such system not only 
the speed of diffusion is not understood but proposals to prove its existence, if present (as 
expected), have so far not given really satisfactory results. 

References: [Ne77]. 

19. The three bodies problem 

Mechanics and the three bodies problem can be almost identified in the sense that 
the motion of three gravitating masses has been since the beginning a key astronomical 
problem and at the same time the source of inspiration for many techniques: foremost 
among them the theory of perturbations. 

As an introduction consider a special case. Let three masses ms = mo,mj = 
mijffiM = 1TI2 interact via gravity, i.e. with interaction potential — kmim^i — Xj| _1 : 
the simplest problem arises when the third body has a neglegible mass compared to the 
two others and the latter are supposed to be on a circular orbit; furthermore the mass mj 
is ems with e small and the mass tum moves in the plane of the circular orbit. This will 
be called the circular restricted three body problem. 
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In a reference system with center S and rotating at the angular speed of J around S 
inertial forces (centrifugal and Coriolis') act. Supposing that the body J is located on the 
axis with unit vector i at distance R from the origin S, the acceleration of the point M is 
q = F + uJq(q — j^i) — 2u>o A q if F is the force of attraction and ujq /\ q = ujqQ 1 - where 

ujq is a vector with |wo| = ujq and perpendicular to the orbital plane and q l d = (~P2, Pi) 
if q = (pi, p^). Here, taking into account that the origin £ rotates around the fixed center 
of mass, uJq(q — j^i) is the centrifugal force while — 2u> A g is the Coriolis force. The 
equations of motion can therefore be derived from a Lagrangian 

£ = \'Q 2 ~ W + u Q ± ■ Q + \ujIq 2 - ool^-Q ■ i (19.1) 

where u)qR 3 = kms(l+s) d = go and W = —^f~ ~ \^m\ ^ ^ ^ s tne gravitational constant, 
R the distance between S and J and finally the last three terms come from the Coriolis 
force (the first) and from the centripetal force (the other two, taking into account that the 
origin S rotates around the fixed center of mass). 

Setting g = g / (1 + e) = kms, the Hamiltonian of the system is 

n = Up - - ft - \^Iq 2 - - r 1 - I • i) (19.2) 

The first part can be expressed immediately in the action-angle coordinates for the 
two body problem, cf. Sect. 8. Calling such coordinates (Lo, Ao, Go, 7o) and 9q the polar 
angle of M with respect to the ellipse major axis and Ao the mean anomaly of M on its 
ellipse, the Hamiltonian becomes, taking into account that for e = the ellipse axis rotates 
at speed — cuq, 



H = -fa - u Go - 6$ (If - il" 1 " 1 • i) (19-3) 



e={l-§)K \Mi - Go. L 



which is convenient if we study the interior problem, i.e. \g\ < R. This can be expressed 
in the action-angle coordinates via (8.1), (8.2): 

#0 = A + fx Q , 6> + 70 = A + 70 + f\ , 

(19.4) 

R ~ gR l+ecos(A +/^)' 

where, see (8.2), f\ = (f(e sin A, e cos A) and f(x, y) = 2x(l + ^y + . . .) with the . . . denoting 
higher orders in x,y even in x. The Hamiltonian takes the form, if u 2 = gR~ s , 

2 

n * = ~^T2 ~ " G o + e% F (Go, L , A , A + 70) (19.5) 

^.Lq Jri 

where the only important feature (for our purposes) is that F(L,G,a, j3) is an analytic 
function of L, G, a, (3 near a datum with |G| < L (i.e. e > 0) and \g\ < R. However the 
domain of analyticity in G is rather small as it is constrained by \G\ < L excluding in 
particular the circular orbit case G = ±L. 

Note that apparently the KAM theorem fails to be applicable to (19.5) because the 
matrix of the second derivatives of Hq(L,G) has determinant. Nevertheless the proof 
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of the theorem goes through also in this case, with minor changes. This can be checked 
by studying the proof or, following a remark by PoiNCARE, by simply remarking that the 

"squared" Hamiltonian H' e d = (H e ) 2 has the form 

K = (-7^2 - w G of + eF'(G , L , A , A + 70) (19.6) 
with F' still analytic. But this time det g^Jzlo) = —§9 2 Fq~^ ui^h 7^ if h = —g 2 Lq 2 — 

2uG Q / 0. 

Therefore the KAM theorem applies to H' s and the key observation is that the orbits 
generated by the Hamiltonian (H e ) 2 are geometrically the same as those generated by the 
Hamiltonian 7i e : they are only run at a different speed because of the need of a time 
rescaling by the constant factor 27i e . 

This shows that, given an unperturbed ellipse of parameters (L ,Go) such that u> = 

2 

(£3-, —a;), Go > 0, with U1/U2 Diophantine, then the perturbed system admits a motion 
which is quasi periodic with spectrum proportional to u> and takes place on an orbit which 
wraps around a torus remaining forever close to the unperturbed torus (which can be 
visualized as described by a point moving, according to the area law on an ellipse rotating 
at rate — ujq) with actions (L , Go), provided e is small enough. Hence 

The KAM theorem answers, at least conceptually, the classical question: can a solution of 
the three body problem remain close to an unperturbed one forever? i.e. is it possible that 
a solar system is stable forever? 

Assuming e, \ q\/R <C 1 and retaining only the lowest orders in e and \q\/R <C 1 the 
Hamiltonian (19.5) simplifies into 

n = -& o -uG + *e(Go) - H^(3cos2(A +70)- 

- e cos A - f e cos(A + 270) + |e cos(3A + 270)) 

where 5 £ (G ) = -((1 + e)* - 1) uG - and e = (1 - Gg/Lg)J. 

It is an interesting exercise to estimate, assuming as model the (19.7) and following 
the proof of the KAM theorem, how small has e to be if a planet with the data of Mercury 
can be stable forever on a (slowly precessing) orbit with actions close to the present day 
values under the influence of a mass e times the solar mass orbiting on a circle, at a distance 
from the Sun equal to that of Jupiter. It is possible to follow either the above reduction to 
the ordinary KAM theorem or to apply directly to (19.7) the Lindstedt series expansion, 
proceeding along the lines of Sect. 14. The first approach is easy but the second is more 
efficient: unless the estimates are done in a particularly careful manner the value found 
for ems does not have astronomical interest. 

References: [Ar68]. 
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20. Rationalization and regularization of singularities 



Often integrable systems have interesting data which lie on the boundary of the in- 
tegrability domain. For instance the central motion when L = G (circular orbits) or the 
rigid body in a rotation around one of the principal axes or the two body problem when 
G = (collisional data). In such cases perturbation theory cannot be applied as discussed 
above. Typically the perturbation depends on quantities like y/L — G and is not analytic 
at L = G. Nevertheless it is sometimes possible to enlarge phase space and introduce new 
coordinates in the vicinity of the data which in the initial phase space are singular. 

A notable example is the failure of the analysis of the circular restricted three body 
problem: it apparently fails when the orbit that we want to perturb is circular. 

It is convenient to introduce the canonical coordinates L, A and G, 7 

L = L , G = L -G , A = A + 70, 7 = ~7o (20.1) 

so that e = v / 2GL~ 1 a/1 — G(2L)~ 1 and Ao = A + 7 and 6>o = Ao + /a where /a is defined 
in (8.2) (see also (19.4)). Hence 

Oo = A + 7 + /a+ 7 , 60 + 70 = A + / A+7 , 



e =v / 2G,/ifl - G_) lei _ L 2 (i-e 2 ) 1 

e v^y L ^J- 2L)-> R " gR 1+e cos(A+ 7 +/ A+7 ) ' 

and the Hamiltonian (19.7) takes the form 



(20.2) 



2 

Hs = ~2L2 " ujL + uG + £ R F ^ L - G ' L ' A + 7 ' A) (20 - 3) 

In the coordinates L, G of (20.1) the unperturbed circular case corresponds to G = and 
the (19.3) once expressed in the action-angle variables G, £,7, A is analytic in a domain 
whose size is controlled by \J~G~. Nevertheless very often problems of perturbation theory 
can be "regularized'' . 

This is done by "enlarging the integrability" domain by adding to it points (one or 
more) around the singularity (a boundary point of the domain of the coordinates) and 
introduce new coordinates to describe simultaneously the data close to the singularity and 
the newly added points: in many interesting cases the equations of motion are no longer 
singular, i.e. become analytic, in the new coordinates and therefore apt to describe the 
motions that reach the singularity in a finite time. One can say that the singularity was 
only apparent. 

Perhaps this is best illustrated precisely in the above circular restricted three body 
problem. There the singularity is where G = 0, i.e. at a circular unperturbed orbit. If we 
describe the points with G small in a new system of coordinates obtained from the one in 
(20.1) by letting alone L, A and setting 

p = V2G cos 7, q = V2G sin 7 (20.4) 

then p, q vary in a neighborhood of the origin with the origin itself excluded. 

Adding the origin of the p, q plane then in a full neighborhood of the origin the 
Hamiltonian (19.3) is analytic in L, A,p, q. This is because it is analytic, cf. (19. 3), (19. 4), 
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as a function of L, A and e cos 6>o and of cos(Ao+6>o). Since 9q = A+7+/A+7 and 6>o+Ao = A+ 
/a+7 by (19.4), the Hamiltonian (19.3) is analytic in L, A, e cos(A + 7 + /a+7) 5 cos(A + /a+ 7 ) 
for e small (i.e. for G small) and, by (8.2), /a+ 7 is analytic in esin(A + 7) and ecos(A + 7). 
Hence the trigonometric identities 

e sin(A + 7 ) = r™^"»* e cos(A + 7) = E^piLA (20 .5) 

together with G = \{p 2 + q 2 ) imply that (20.3) is analytic near p = q = and L > 
0, A G [0,2-71"]. The Hamiltonian becomes analytic and the new coordinates are suitable to 

def 1 2,2 ! 

describe motions crossing the origin: e.g. setting C = 4(1 — p 7^ ) L~? (19.7) becomes 



n = - & - uL + u\{p 2 + q 2 ) + 8 £ {\{p 2 + q 2 )) - H ^-^W)) 4 . 
• (3 cos 2A - ((-11 cos A + 3 cos 3A) p - (7 sin A + 3 sin 3A) q) Cj 



(20.6) 



The KAM theorem does not apply in the form discussed above to "cartesian coor- 
dinates" i.e. when, as in (20.6), the unperturbed system is not assigned in action-angle 
variables: however there are versions of the theorem (actually corollaries of it) which do 
apply and therefore it becomes possible to obtain some results even for the perturbations 
of circular motions by the techniques that have been illustrated here. 

Likewise the Hamiltonian of the rigid body with a fixed point O and subject to analytic 
external forces becomes singular, if expressed in the action-angle coordinates of Deprit, 
when the body motion nears a rotation around a principal axis or more generally nears 
a configuration in which any two of the axes i3, z, zq coincide (i.e. any two among the 
principal axis, the angular momentum axis and the inertial z-axis coincide, see Section 9) . 
Nevertheless by imitating the procedure just described in the simpler cases of the circular 
three body problem, it is possible to enlarge phase space so that in the new coordinates 
the Hamiltonian is analytic near the singular configurations. 

A regularization also arises when considering collisional orbits in the unrestricted 
planar three body problem. In this respect a very remarkable result is the regularization 
of collisional orbits in the planar three body problem. After proving that if the total 
angular momentum does not vanish simoultaneous collisions of the three masses cannot 
happen within any finite time interval the question is reduced to the regularization of two 
bodies collisions, under the assumption that the total angular momentum does not vanish. 

The local change of coordinates which changes the relative position coordinates (x, y) 
of two colliding bodies as (x, y) — > (£, 77) with x + iy = (£ + it]) 2 is not one to one, hence it 
has to be regarded as an enlargement of the positions space, if points with different (£, rj) 
are considered different. However the equations of motion written in the variables £, t] have 
no singularity at £, 77 = 0, ( Levi- Ci vita). 

Another celebrated regularization is the regularization of the Schwartzschild metric, 
i.e. of the general relativity version of the two body problem: it is however somewhat out 
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of the scope of this review (Synge, Kruskal). 
References: [LC]. 

Appendix Al. KAM resummation scheme 



The idea to control the "remaining contributions" is to reduce the problem to the case 
in which there are no pairs of lines that follow each other in the tree order and which have 
the same current. Mark by a scale label "0" the lines of a tree whose divisors are > 1: these 
are lines which give no problems in the estimates. Then mark by a scale label "> 1" the 
lines with current v(l) such that \ui ■ u(l)\ < 2~ n+1 for n = 1 (i.e. the remaining lines). 

The lines labeled are said to be on scale 0, while those labeled > 1 are said to be 
on scale > 1. A cluster of scale will be a maximal collection of lines of scale forming a 
connected subgraph of a tree 9. 

Consider only trees 9q G ©o of the family Go of trees containing no clusters of lines 
with scale label which have only one line entering the cluster and one exiting it with equal 
current. 

It is useful to introduce the notion of a line £\ situated "between" two lines £,£' with 
£' > £: this will mean that £\ precedes £' but not £. 

All trees 6 in which there are some pairs /' > / of consecutive lines of scale label 
> 1 which have equal current and such that all lines between them bear scale label are 
obtained by "inserting" on the lines of trees in 6 with label > 1 any number of clusters of 
lines and nodes, with lines of scale and with the property that the sum of the harmonics 
of the nodes inserted vanishes. 

Consider a line Iq G 9q £ ©o linking nodes v± < V2 and labeled > 1 and imagine 
inserting on it a cluster 7 of lines of scale with sum of the node harmonics vanishing and 
out of which emerges one line connecting a node v ou t in 7 to V2 and into which enters one 
line linking v\ to a node Vi n G 7. The insertion of a /c-lines, I7I = (k + l)-nodes, cluster 
changes the tree value by replacing the line factor, that will be briefly called "value of the 
cluster 7, 

v vi ■ »V2 _^ (u Vl -M(r,^{l ))u V2 ) 1 
uj -u(1 ) 2 u -v(l ) 2 UJq-v{I q ) 2 

where Mis a £x£ matrix M ra (7, i/(Z )) = ^^out.r^W Y[ vei {-fu v ) Y[ iei J^fc if^ = v'v 
denotes a line linking v' and v. Therefore if all possible connected clusters are inserted 
and the resulting values are added up the result can be taken into account by attributing 

to the orie; inal line Z a factor like (Al.l) with M^°\iy(l )) d = Y. 1 M i.l^{h)) replacing 
M( 7 ;i/(/ )). 

If several connected clusters 7 are inserted on the same line and their values are 
summed the result is a modification of the factor associated with the line Iq into 
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Eoo / M y ">(v(l n )) \ 1 _ / 1 \ ( A~\ 0\ 

fc=0^i ' ^ u -v{l )* ) V vi u, .v{l )* - \ U vi ■ u> -v{l y-MW{v{l Q )) Uv 2) (Al.Z) 

The series defining involves, by construction, only trees with lines of scale 0, hence 

with large divisors so that it converges to a matrix of small size of order e (actually e 2 , 
looking more carefully) if e is small enough. 

Convergence can be established by simply remarking that the series defining is 
built with lines with values > \ of the propagator, so that it certainly converges for e 
small enough (by the estimates in Section 12 where the propagators were identically 1) 
and the sum is of order e (actually £ 2 ), hence < 1. However such an argument cannot 
be repeated when dealing with lines with smaller propagators (which still have to be 
discussed). Therefore a method not relying on so trivial a remark on the size of the 
propagators has eventually to be used when considering lines of scale higher than 1, as it 
will soon become necessary. 

The advantage of the collection of terms achieved with (Al.2) is that we can represent 
h as a sum of values of trees which are simpler because they contain no pair of lines of 
scale > 1 with in between lines of scale with total sum of the node harmonics vanishing. 
The price is that the divisors are now more involved and we even have a problem due 
to the fact that we have not proved that the series in (Al.2) converges. In fact it is a 
geometric series whose value is the r.h.s. of (Al.2) obtained by the sum rule (13.7) unless 
we can prove that the ratio of the geometric series is < 1. This is trivial in this case by the 
previous remark: but it is better to remark that there is anothr reason for convergence, 
whose use is not really necesary here but it will become essential later. 

The property that the ratio of the geometric series is < 1 can be regarded as due 
to the consequence of the cancellation mentioned in Section 14 which can be shown to 
imply that the ratio is < 1 because M^°'{y) = e 2 {ujq ■ v) 2 m( \v) with |m(°)(f)| < Do for 
some Dq > and for all |e| < £o for some £o : so that for small e the divisor in (Al.2) is 
essentially still what it was before starting the resummation. 

At this point an induction can be started. Consider trees evaluated with the new rule 
and place a scale lavel "> 2" on the lines with \ujq ■ u(l) \ < 2~ n+1 for n = 2: leave the label 
"0" on the lines already marked so and label by "1" the other lines. The lines of scale "1" 
will satisfy 2~ n < \u> • f (01 — 2 _n+1 for n = 1. And the graphs will now possibly contain 
lines of scale 0,1 or > 2 while lines with label "> 1 " no longer can appear, by construction. 

A cluster of scale 1 will be a maximal collection of lines of scales 0, 1 forming a 
connected subgraph of a tree 9 and containing at least one line of scale 1. 

The construction carried considering clusters of scale can be repeated by considering 
trees 6\ G ©i with ©i the collection of trees with lines marked 0, 1 or > 2 and in which 
no pairs of lines with equal momentum appear to follow each other if between them there 
are only lines marked or 1. 

Insertion of connected clusters 7 of such lines on a line Iq of 6\ leads to define a matrix 
jVf (!) formed by summing tree values of clusters 7 with lines of scales or 1 evaluated with 
the line factors defined in (Al.l) and with the restriction that in 7 there are no pairs of 
lines i < £' with the same current and which follow each other while any line between them 
has lower scale (i.e. 0), here between means preceding I' but not preceding /, as above. 
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Therefore a scale independent method has to devised to check convergence for 
and for the matrices to be introduced later to deal with even smaller propagators. This is 
achieved by the following extension of Siegel's theorem mentioned in Section 14: 

Let u>Q satisfy (13.2) and set u> = Cu>q. Consider the contribution to the sum in (14-3) 
from graphs 6 in which 

(1) no pairs I' > I of lines which lie on the same path to the root carry the same current 
v if all lines i\ between them have current v(t\) such that |u? ■ v(l{)\ > 2\u> ■ v\. 

(2) the node harmonics are bounded by \u\ < N for some N . 

Then the number of lines I in 6 with divisor u> ■ v>t satisfying 2~ n < |u> ■ U(\ < 2~ n+1 does 
not exceed 4 Nk 2~ n / T , n = 1, 2, . . .. 

This implies, by the same estimates in (14.6), that the series defining M*- 1 -* converges. 
Again it must be checked that there are cancellations implying that M^iy) = e 2 (u>o • 
v) 2 m( l \v) with \my-\u)\ < D for the same D > and the same Eq. 

At this point one deals with trees containing only lines carrying labels 0, 1, > 2 and 
the line factors for the lines I = v'v of scale are v v > ■ v v /(ujq ■ v(t)) 2 , those of the lines 
i = v'v of scale 1 have line factors v v > ■ (cl» • v{€) 2 — M^°\u{t}))~ 1 v v and those of the lines 
i = v'v of scale > 2 have line factors v v < ■ (a>o • v{€) 2 — M^ 1 \v{€)y)~ 1 v v . Furthermore no 
pair of lines of scale "1 " or of scale "> 2 " with the same momentum and with only lines 
of lower scale (i.e. of scale "0" in the first case or of scale "0","1" in the second) between 
then can follow each other. 

And so on until, after infinitely many steps, the problem is reduced to the evaluation 
of tree values in which each line carries a scale label n and there are no pairs of lines which 
follow each other and which have only lines of lower scale in between. Then the Siegel 
argument applies once more and the series so resummed is an absolutely convergent series 
of functions analytic in e: hence the original series is convergent. 

Although at each step there is a lower bound on the denominators it would not be 
possible to avoid using Siegel's theorem. In fact the lower bound would becomes worse and 
worse as the scale increases. In order to check the estimates of the constants D , e which 
control the scale independence of the convergence of the various series it is necessary to take 
advantage of the theorem, and of the absence at each step of the necessity of considering 
trees with pairs of consecutive lines with equal momentum and intermediate lines of higher 
scale. 

One could also perform the analysis by bounding order by order with no resum- 
mations (i.e. without changing the line factors) and exhibiting the necessary cancellations. 
Or the paths that KOLMOGOROV, Arnold and Moser used to prove the first three (some- 
what different) versions of the theorem, by successive approximations of th equations for 
the tori, can be followed. 

The invariant tori are Lagrangian manifolds just as the unperturbed ones (cf. com- 
ments after (6.4)) and, in the case of the Hamiltonian (14.1) the generating function 
A • if) + $(A, if)) can be expressed in terms of their parametric equations 
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$(A, t/0 = G(i/>) + a-if) + h(^) ■ (A - u> - Ah(VO) 



(A1.3) 




h(i/>)d4,Ah(if>) 



(27T) 



where A = (u> • d ^) and the invariant torus corresponds to A' = w in the map a. = xj) + 
d A $(A, ?/>) and A' = A + d^$(A, In fact by (Al.3) the latter becomes A' = A- Ah 
and, from the second of (13.3) written for / depending only on he angles a, it is A = u>+Ah 
when A, a: are on the ivariant torus. 

Note that if a exists it is necessarily determined by the third relation but the check 
that the second equation in (Al.3) is soluble (i.e. that the r.h.s. is an exact gradient up to 
a constant) is nontrivial. The canonical map generated by A • i/> + $(A, i/>) is also defined 
for A' close to uj and foliates the neighborhood of the invariant torus with other tori: of 
course for A' ^ uj the tori defined in this way are, in general, not invariant. 

References: [GBG94]. 

Appendix A2. Coriolis and Lorentz forces. Larmor precession 

Larmor precession is the part of the motion of an electrically charged particle caused 
by the action of a magnetic field H (in an inertial frame of reference). It is due to the 
Lorentz force which, on a unit mass with unit charge, produces an acceleration g = vAH 
if the speed of light is c = 1. 

Therefore if H = Hk is directed along the k axis the acceleration it produces is the 
same that the Coriolis force would impress on a unit mass located in a reference frame 
which rotates with angular velocity cuok around the k axis if H = 2cuok. 

The above remarks imply that a homogeneous sphere homogeneoulsy electrically 
charged with a unit charge and freely pivoting about its center in a constant magnetic 
field H directed along the k axis undergoes the same motion it would follow if not subject 
to the magnetic field but seen in a non inertial reference frame rotating at constant angu- 
lar velocity ujq around the k axis if H and ujq are related by H — 2ujq: in this frame the 
Coriolis force is interpreted as a magnetic field. 

This holds, however, only if the centrifugal force has zero moment with rerspect to 
the center: true in the spherical symmetry case only. In spherically non symmetric cases 
the centrifugal forces have in general non zero moment so the equivalence between Coriolis 
forces and magnetic fields is only approximate. 

The Larmor theorem makes this more precise. It gives a quantitative estimate of the 
difference between the motion of a general system of particles of mass m in a magnetic 
field and the motion of the same particles in a rotating frame of reference but in absence 
of a magnetic field. The approximation is estimated in terms of the size of the Larmor 
frequency eH/2mc : which should be small compared to the other characteristic frequencies 
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of the motion of the system: the physical meaning is that the centrifugal force should be 
small compared to the other forces. 

The vector potential A for a constant magnetic field in the k-direction H = 2cuok is 
A = 2cuok A g = 2cuo£ ± - Therefore, from the treatment of the Coriolis force in Section 
19, see (19.2), the motion of a charge e with mass m in a magnetic field H with vector 
potential A and subject to other forces with potential W can be described, in an inertial 
frame and in generic units in which the speed of light is c, by a Hamiltonian 

n= rL {p -- c A)2+w{e) {A2A) 

where p = rag + | A and g are canonically conjugated. 
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